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ASSAYS, THERAPEUTIC METHODS AND MEANS 
The present invention relates to screening methods, 
peptides, mimetics, and methods of use based on the 
surprising discovery and characterisation of an 
interaction between known proteins, and thus numerous 
cellular processes of interest in therapeutic contexts . 
The proteins in question are ATM and p53, and the 
inventors have found that ATM phosphorylates p53 at a 
number of specific sites. This interaction is observed 
with other related proteins with associated kinase 
activity, in particular ATR and DNA-PK, and other 
proteins having similar phosphorylation sites to p53 . 
Further aspects of the present invention are founded on 
the discovery that ATM binds DNA and that such binding 
has an effect on phosphorylation of p53 by ATM. 

Ataxia-telangiectasia (A-T) is a human autosomal 
recessive disorder characterised by a number of 
debilitating symptoms, including a progressive cerebellar 
degeneration, occulocutaneous telangiectasia, growth 
retardation, immune deficiencies and certain 
characteristics of premature ageing (reviewed in Jackson, 
1995; Meyn, 1995; Shiloh, 1995). A-T patients exhibit an 
approximately 100 -fold increased incidence of cancer, 
with patients being particularly predisposed to 
malignancies of lymphoid origin. Furthermore, A-T 
heterozygotes, which comprise -1% of the population, are. 
reported to exhibit a higher incidence of breast cancer 
(Easton, 1994; Meyn, 1995), although this remains 
controversial (Fitzgerald et al . , 1997). At the cellular 
level, A-T is characterised by a high degree of 
chromosomal instability, radioresistant DNA synthesis, 
and hypersensitivity to ionising radiation (IR) and 
radiomimetic drugs. In addition, A-T cells are defective 
in the radiation induced Gl-S, S, and G2-M cell cycle 
checkpoints that are thought to arrest the cell cycle in 
response to DNA damage in order to allow repair of the 
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genome prior to DNA replication or mitosis (Halazonetis 
et al., 1993; Beamish et al., 1994; Beamish and Lavin, 
1994; Khanna et al., 1995; Barlow et al.. 1996; Xu and 
Baltimore, 1996) . A-T cells exhibit deficient or 
severely delayed induction of p53 in response to IR 
(Kastan et al., 1992; Khanna and Lavin, 1993; Lu and 
Lane, 1993; Xu and Baltimore, 1996). p53 mediated 
transcriptional activation of p21/WAPl/CIPl and Gadd45, 
and the subsequent inhibition of Gl cyclin-dependent 
kinases, are also defective in A-T cells following IR 
exposure (Artuso et al., 1995; Khanna et al., 1995). Lu 
and Lane, 1993, however, reported very lijttle difference 
in the p53 response from normal and A-T cells. 

15 Furthermore, yeast have an ATM homologue (Mecl) but do 

not have p53 (Goffeau et al.). The best data for a 
possible substrate for Meclp is Spkl/Rad53 (Sun et al; 
Sanchez et al. ) 

20 The gene mutated in A-T patients, termed ATM (A-T 

mutated) , has been mapped and its cDNA cloned (Savitsky 
et al., 1995a; Savitsky et al., 1995b). Sequence 
analyses reveal that the ATM gene encodes a -350 kDa 
polypeptide that is a member of the phosphatidyl inositol 
25 (PI) 3 -kinase family of proteins by virtue of a putative 

kinase domain in its carboxyl -terminal region (Savitsky 
et al., 1995a; Savitsky et al., 1995b). Classical PI 
3 -kinases, such as PI 3 -kinase itself, are involved in 
signal transduction and phosphorylate inositol lipids 
that act as intracellular second messengers (reviewed in 
Kapeller and Cantley, 1994). ATM bears sequence 
similarity with a subset of the PI 3 -kinase protein 
family that comprises proteins which, like ATM, are 
involved in cell cycle control and/or in the detection 
35 and signalling of DNA damage (for reviews see Hunter, 

1995; Keith and Schreiber, 1995; Zakian, 1995; Jackson, 
1996). Included in this sub-group- are Saccharomyces 
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cerevisiae Torlp and Tor2p and their mammalian homologue 
: FRAP, which control progression into S-phase and, at 
•least in part, function by regulating translation (Brown 
and Schreiber, 1996). Also in this sub-group is the DNA 
5 dependent protein kinase (DNA-PK) catalytic subunit 

(DNA-PKcs) , defects in which lead to sensitivity to IR 
and an inability to perform site-specific V(D)J 
recombination (reviewed in Jackson and Jeggo, 1995; 
Jackson, 1996) . Other members of the ATM sub-group of 
10 the PI 3 -kinase family that have been identified include 

S. cerevisiae Tellp and Meclp, together with the Meclp 
homologues of Schizosaccharomyces pombe (^ad3), 
Drosophila melanogaster (mei-41) and humans FRPl/ATR; 
(Keith and Schreiber, 1995; Zakian, 1995; Jackson, 1996). 
As with ATM, defects in these proteins lead to genomic 
instability, hypersensitivity towards DNA damaging agents 
and defects in DNA damage - induced cell cycle checkpoint 
controls. 



15 



ATM is most similar to S. cerevisiae Tellp, which has not 
been shown to have any biochemical function so far 
(identity and similarity are 45% and 66%, respectively) . 
ATM is much further diverged from DNA-PKcs (28% identical 
and 51% similar) , with essentially the same homology to 
PI 3 -kinase (a bona fide lipid kinase: 24% identical and 
51% similar) . Thus, from the sequence comparisons alone, 
one could not predict that ATM would be a protein kinase 
akin to DNA-PKcs or a lipid kinase akin to PI 3-kinase. 

Although genetic data indicate an involvement of ATM- like 
proteins in DNA damage recognition and its repair, the 
mechanisms by which these proteins function are not well 
understood. Much is known about the clinical symptoms 
and cellular phenotypes that arise from mutations in ATM, 
but little is known about the mechanisms by which the ATM 
protein functions. Recent studies have revealed that, 
like DNA-PKcs, ATM is expressed ubiquitously and is 



BNSOOCIO: <WO__^9904266A2_L> 



wo 99/04266 



PCT/GB98/02115 



4 



localised predominantly in the cell nucleus (Chen and 
Lee, 1996; Lakin et al.. 1996; Brown et al . , 1997,. 
Watters et al., 1997). 

The realisation that ATM is a member of the PI 3 -kinase 
family has suggested to some that the primary function of 
ATM might phosphorylate inositol phospholipids. Savitsky 
et al (1995 Science 268, 1749-1753), for example, do not 
discuss protein phosphorylation. Indeed, several lines 
of evidence suggest that ATM might have functioned in a 
very different way from that which we have established 
herein. For example, defective protein tyrosine 
phosphorylation and calcium mobilization in response to 
the triggering of B-cells and T-cells of A-T patients 
support the idea of defects in intra-cytoplasmic 
signalling pathways in A-T cells (cited in the Savitski 
Science paper 1995) . These data are provided in the 
paper Khanna et al (1997; J. Biol. Chem.). This paper 
also summarises a variety of other data suggesting 
different ways in which ATM might function. 

Savitsky et al (Science 1995) state that the 
insulin-dependent diabetes observed in some A-T patients 
could reflect ATM acting in an analogous way to PI 
3 -kinase affecting glucose transport by insulin. They 
also discuss PI 3 -kinase in terms of controlling 
apoptosis as a paradigm for ATM, ie. one can explain many 
of the features of A-T by suggesting that it works 
analogously to PI 3 -kinase. 

Some A-T cells have been shown to be complemented by a 
gene called ATDC, whose product interacts with an 
intermediate filament protein called vimentin, which is 
cytoplasmic (Brzoska et al; PNAS) . They state that A-T 
cell lines have aberrantly aggregated actin filaments, 
suggesting the role of ATM lies in the cytoplasm. 
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We have purified ATM. We report that, ATM binds to DNA 
and possesses an associated protein kinase activity that 
is stimulated by DNA. Furthermore, we show that ATM 
serves as a kinase for p53 and that the sites of 
phosphorylation reside in functionally important regions 
of the p53 polypeptide. These sites are Serl5 and Thrl8. 
We also show that DNA-PK is also capable of 
phosphorylating the Serl5 and Thrl8 sites of p53, and 
that ATR phosphorylates Serl5. Further, we show that 
phosphorylation of these sites of p53 disrupts the 
interaction of p53 with Mdm-2, a protein which targets 
p53 for degradation within the cell. 

By targeting these sites, ATM may activate p53 for DNA 
15 binding and/or cause disassociation of Mdm-2, thus 

stabilising p53 (leading to increased amounts of the. 
protein) and would allow it to activate transcription. 
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Thrl8 of p53 has to our knowledge never been shown to be 
phosphorylated in vivo or in vitro. This site does not 
conform to a characterized DNA-PK consensus 
phosphorylation site. Thus, our finding of 
phosphorylation here is totally unexpected. 



Serl5 is phosphorylated by DNA-PK, but nonetheless its 
phosphorylation by ATM is also surprising, particularly 
since there are no data indicating its phosphorylation in 
reponse to DNA damage being altered in A-T cells. 

Based on this and other work described below, the present 
invention in various aspects provides for modulation of 
interaction between ATM (and ATR) and p53, particularly 
phosphorylation of p53 by ATM and ATR, and DNA binding by 
these proteins, which is further shown to have a 
potentiating effect on phosphorylation of p53. 

Various aspects of the present invention provide for the 



BNSDOCID; <W0 9904266A2,L> 



wo 99/04266 



PCT/GB98/02115 



. 6 

use of ATM (or related kinases such as ATR or DNA-PK) and 
p53, with or without DNA, in screening methods and assays 
=for agents which modulate interaction between ATM and 
p53, particularly phosphorylation of p53 by ATM. 

Further aspects provide for modulation of interaction 
between ATM, or related kinases such as ATR or DNA-PK, 
and other molecules including a phosphorylation site 
homologous to those in p53 which are phosphorylated by 
ATM, and use of these molecules in screening methods and 
assays for useful agents. For simplicity, much of the 
present disclosure refers to ATM and p53 However, 
unless the context requires otherwise, every such 
reference should be taken to be equally applicable to the 
interaction between ATM and other molecules including a 
site homologous to one of those in p53 phosphorylated by 
ATM. Similarly, based on the disclosure herein, the 
invention extends to the use of other protein kinases 
which have an associated protein kinase activity capable 
of phosphorylating sites of p53, in particular Serl5 and 
Thrl8. Typically, the protein kinase domain of these 
other kinases will share at least 30% amino acid sequence 
identity with the corresponding domain of ATM, more 
preferably at least 35% sequence identity, more 
preferably at least 40% sequence identity, more 
preferably at least 50% sequence identity, more 
preferably at least 70% sequence identity, still more 
preferably at least 90% sequence identity. Examples of 
such kinases are ATR (also known as FRPl, see Cimprich et 
al, 1996) and DNA-PKcs. 

Such molecules may be identified by various means. For 
instance, information may be obtained about residues 
which are important for p53 phosphorylation by ATM using 
alanine scanning and deletion analysis of p53 and/or 
peptide fragments, for instance the N- terminal 42 amino 
acids or so of p53, or a fragment of .around 10 amino 
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acids including the relevant site of phosphorylation. 
Mutation may be used to identify residues which affect 
phosphorylation and those which do not. When key 
residues are identified, computer sequence databases may 
be scanned for proteins including the same or similar 
pattern of residues, taking into account conservative 
variation in sequence (see below) as appropriate. 
Candidate molecules may then be used in one or more 
assays for phosphorylation by ATM (such as discussed 
below) . 

Identification of key residues for phosphorylation at any 
of the sites in p53 phosphorylated by ATM may also be 
used in the design of peptide and non-peptidyl agents 
which modulate, particularly inhibit, phosphorylation of 
p53 by ATM, as discussed further below. 

Methods of obtaining agents able to modulate interaction 
between ATM and p53 (or, it must be remembered, ATR, or a 
related protein having a similar associated kinase 
activity, and other molecules including a phosphorylation 
site homologous to one of those phosphorylated in p53 by 
ATM) include methods wherein a suitable end-point is used 
to assess interaction in the presence and absence of a 
test substance. Assay systems may be used to determine 
ATM kinase activity, ATM DNA binding and/or ATM 
interaction with one or more other molecules. For 
phosphorylation assays, full-length p53, truncated 
portions of p53, or portions of p53 fused to other 
proteins (eg. GST), or a suitable variant or derivative 
of any of these may be used. Peptide phosphorylation 
assays may be developed using peptides that correspond to 
the phosphorylated regions of p53 . The phosphorylation 
of any of the above may be assayed by any of a variety of 
procedures such as discussed below and may be adapted to 
high throughput screening approaches. Interference of 
DNA binding may be assayed but the inhibition of kinase 
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activity may be more sensitive and identify a greater 
breadth of inhibitors to DNA binding inhibition, and so 
; ' may be preferred by the skilled operator of the present 
• invention. 

5 

ATM kinase activity may be assayed for either of the two 
N-terminal p53 sites. When assaying for phosphorylation, 
DNA is preferably included in the assay system. Related 
but different screens may be set up for inhibitors and 
10 activators of the two sites of ATM-mediated 

phosphorylation event. 

Generally of most interest is modulation of the 
phosphorylation of p53 (or other molecule) by ATM. 

.15 Detailed disclosure in this respect is included below. 

It is worth noting, however, that combinatorial library 
technology provides an efficient way of testing a 
potentially vast number of different substances for 
ability to modulate an interaction with and/or activity 

20 of a polypeptide. Such libraries and their use are known 

in the art, for all manner of natural products, small 
molecules and peptides, among others. The use of peptide 
libraries may be preferred in certain circumstances. 

25 At the cellular level, A-T cells display chromosomal 

instability, radiosensitivity, are impaired in p53 
induction following treatment with ionising radiation, 
and show altered regulation of transcription factor NFkB. 
Thus, the wild- type ATM gene functions as a tumour 

30 suppressor, and is a suppressor of neurological 

degeneration and other degenerative states commonly 
associated with ageing. 

Given the results reported herein on which the present 
35 invention is based, activators and inhibitors of 

ATM-associated kinase activity may be identified and 
appropriate agents may be obtained; designed and used for 
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any of a variety of purposes: 

■ A-T Therapy. Activators of ATM or ATR function may prove 
to have utility in treating humans with A-T (discussed 
further below) . 

Modulation of immune system function. A-T patients 
display immunodeficiencies, demonstrating that ATM is 
required for generation of a fully functional immune 
system. Modulators of ATM or ATR may, therefore, be used 
in regulating immune system function. 

A JDS therapy. It has been shown that the lymphocytes of 
humans entering the final stages of AIDS have shortened 
telomeres and this may contribute to them being no longer 
able to replenish the immune system. Cells of A-T 
patients lose their telomeres more quickly than those of 
normal individuals, revealing that ATM plays a positive 
role in telomere length homeostasis. Activators of ATM 
function may, therefore, find utility in treatment of 
individuals with AIDS through lengthening the telomeres 
of senescent lymphocytes in these individuals, thus 
allowing replenishment of the immune system. 

p53 therapy. The identification of the site of p53 
phosphorylated by ATM indicates that this of extreme 
regulatory importance. Indeed, the N-terminal sites on 
p53 phosphorylated by ATM reside within the region known 
as "conserved region I" that has been shown to function 
together with flanking sequences in the interaction with 
the protein Mdm-2 (see Kussie et al 1996; Picksley et 
al., 1994; Momand et al., 1992; Chen et al . , 1993 and 
references therein) . Mdm-2 serves as a negative 
regulator of p53 by two mechanisms. First, it masks the 
p53 transcriptional activation domain, stopping p53 
activating genes (Momand et al., 1992). Second, Mdm-2 
has been shown recently to target p53 for degradation 
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within the cell (Kubbutat et al.. 1997; Haupt et al . , 
1997) . Our data therefore provide an indication that 
phosphoirylation of p53 by ATM will disrupt its 
interactions with Mdm-2, thus resulting in increased 
levels of transcriptionally active p53 . This knowledge 
may, therefore, be utilised to generate novel therapeutic 
agents that target p53 - such as small molecules that, 
through binding to mutant p53, mimic ATM-mediated 
activation of this molecule. 

Phosphorylation at any one or more of these sites may 
affect interaction of pS3 with a number of proteins. Mdm2 
is one particularly example given the location of Thrl8 
within the site on p53 to which Mdm2 binds (see e.g. Chen 
et al., (1993), Kussie et al., (1996), Picksley et al., 
(1994) and Momand et al . , (1992) for characterisation of 
this interaction) and Sens which lies immediately 
adjacent to the minimal Mdm2 binding sequence. Indeed, a 
report by Shieh et al published in October 1997 indicates 
that phosphorylation at SerlS can disrupt the p53-Mdm2 
interaction. Phosphorylation of p53 may be used to 
affect interaction of p53 with any of a number of other 
proteins, including CBP (Gu et al.; Lill et al.), 
adenovirus ElB protein, which binds within the amino 
terminal 123 amino acids of p53 (Kao et al., 1990), with 
residues Leu- 22 and Trp-23 playing an inportant role 
(Lin et al., 1994), transcription factors XPD (Rad3) and 
XPB, as well as CSB involved in strand-specific DNA 
repair (Wang et al., 1995), TFIIH (Xiao et al., 1994), 
E2F1 and DPI (O'Connor et al., 1995), Cellular 
Replication Protein A (Li and Botchan, 1993) , replication 
factor RPA (Dutta et al., 1993), WTl (Maheswaran et al.. 
1993), TATA-binding protein (Seto et al., 1992, Truant et 
al., 1992, Martin et al., 1993), and TAF(II)40 and 
TAP (II) 60 (Thut et al., 1995). 

An assay according to the present invention as discussed 
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further below may determine the role of phosphorylation 
of p53 by ATM on any of these interactions and an agent 
found to be able to modulate such phosphorylation may be 
used to disrupt or promote any of these interactions, 
e.g. in a therapeutic context. 

Modulating telomere length. A-T cells show accelerated 
rates of telomere shortening (Metcalfe et al . , 1996, 
Nature Genetics 13, 350-353). Thus, regulators of ATM 
activity may be used to control telomere length. ATM 
does not appear to be part of the telomerase enzyme 
itself (Metcalfe et al. shows that telom^jrase levels are 
normal in A-T cells; also, our data and the data of 
Pandita et al. 1995 show that A-T cells have some.what 
shortened telomeres .but do not have repressed levels of. 
telomerase) . Thus, ATM works not as part of telomerase 
but as part of a telomere length homeostatic mechanism. 
It is therefore likely that ant i -ATM drugs will work 
synergistically with anti-telomerase drugs. 

Ageing. A-T patients display enhanced rates of ageing, 
display a number of symptoms associated with increased 
age (neurological deterioration, cancers, immunological 
deficiencies etc) , and their cells show shortened 
lifespan in culture. Agents that modulate ATM activity 
may therefore be used to treat/prevent disease states 
associated with premature and normal ageing. 

Tumour/Cancer therapy. This is discussed below. Drugs 
that modulate ATM action may be used to treat A-T 
patients; treat cancer - through affecting cellular 
growth capacity by shortening cells telomeres; manipulate 
the immune system - A-T patients are somewhat 
immunodeficient; treat cancer - radiosensitization of 
tumours etc (see below) . Also, ATM modulators may be 
used to limit cell growth potential by affecting telomere 
length etc.. The linkage to p53 may 'allow p53 therapy, 
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activating p53 in cancer cells, which may lead to cell 
growth arrest and/or cell death via apoptosis or another 
' route . 

5 • Activators of ATM (or ATR. DNA-PK or related kinases) may 
be used, for example, to inhibit cell proliferation by 
activating cell cycle checkpoint arrest in the absence of 
cellular damage, which may be used in the treatment of 
tumours, cancer, psoriasis, arteriosclerosis and other 
10 hyper-proliferative disorders. Activators may be 

employed to activate p53 in cells without damaging. the 
cells. Cells of a patient may be treated so that normal 
cells (p53+) stop growing and are thus refractory to 
killing by administration of a drug that kills cells via 
interfering with cell division or DNA replication, while 
tumour cells (many of which are p53 negative) do not 
arrest and are consequently selectively killed by the 
aforementioned agents. By way of example, ATM activators 
include peptides capable of recognising and binding to 
both ATM and p53 but which do not interfere with the 
phosphorylation of the SerlS and Thrl8 sites of p53, or 
substances capable of activating ATM in a similar manner 
to the activation observed using DNA. 



15 



20 



25 



30 



35 



Cancer radiotherapy and chemotherapy may be augmented 
using agents in accordance with the present invention. 
Ionising radiation (IR) and radiomimetic drugs are used 
commonly to treat cancers, and kill cancer cells 
predominantly via inflicting DNA damage. Cells deficient 
in ATM are hypersensitive to ionising radiation and 
radiomimetics . Thus, inhibitors of the ATM will 
hypersensitise cells to the killing effects of ionising 
radiation and radiomimetics. ATM inhibitors may thus be 
used as adjuncts in cancer radiotherapy and chemotherapy. 

Cell growth capacity may be modulated e.g. in treatment 
of cancer, ageing, and AIDS. It is. established that ATM 
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plays a crucial role in controlling the length. of 
: telomeric chromosomal ends (Metcalfe et al.). Telomeric 
•ends in most normal cell types shorten at each cell 
•division, and cells with excessively shortened telomeres 
are unable to divide. Thus, telomeres are thought to 
function as a "division counting apparatus" that limits 
the proliferative capacity of most normal mammalian 
cells. Inhibitors of ATM function may, therefore, have 
utility in preventing cancer progression by limiting the 
growth potential of cancerous or pre-cancerous cells. 
Activators of ATM may be used to release senescent cells 
from growth arrest and may thus have utility in 
treatments of aged individuals. In addition, it has been 
shown recently that the lymphocytes of humans entering 
the final stages of AIDS have shortened telomeres and 
this may contribute to these cells being no longer able 
to proliferate and replenish the immune system. ATM 
activators may, therefore, result in lengthening of the 
telomeres of such cells and restoring their proliferative 
capacity. 

Interaction between ATM and p53 may be inhibited by 
inhibition of the production of the relevant protein. 
For instance, production of one or more of these 
components may be inhibited by using appropriate nucleic 
acid to influence esqjression by antisense regulation. 
The use of anti-sense genes or partial gene sequences to 
down-regulate gene expression is now well-established. 
Double- stranded DNA is placed under the control of a 
promoter in a "reverse orientation" such that 
transcription of the " ant i- sense" strand of the DNA 
yields RNA which is complementary to normal mRNA 
transcribed from the "sense" strand of the target gene. 
The complementary ant i- sense RNA sequence is thought then 
to bind with mRNA to form a duplex, inhibiting 
translation of the endogenous mRNA from the target gene 
into protein. Whether or not this i's the actual mode of 
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action is still uncertain. However, it is established 
fact that the technique works. 

Another possibility is that nucleic acid is used which on 
transcription produces a ribozyme, able to cut nucleic 
acid at a specific site - thus also useful in influencing 
gene expression. Background references for ribozytnes 
include Kashani-Sabet and Scanlon, 1995, Cancer Gene 
Therapy, 2(3): 213-223. and Mercola and Cohen, 1995, 
Cancer Gene Therapy, 2(1), 47-59. 



Thus, various methods and uses of modulators, which 
inhibit or potentiate interaction of ATM and p53 , 
particularly phosphorylation of p53 by ATM, are provided 
15 as further aspects of the present invention. The purpose 

of disruption, interference with or modulation of 
interaction between ATM and p53, particularly the 
phosphorylation of p53 by ATM may be to modulate any 
activity mediated by virtue of such interaction, as 
discussed above and further below. 



Various aspects of the present invention relate to 
modulation of interaction between ATM and DNA. Such 
interaction is established here we believe for the first 
25 time, and is further shown to have an effect on p53 

phosphorylation by ATM. It was surprising that ATM is a 
DNA binding protein, as there are data suggesting that it 
is associated with microsomal membranes in the cytoplasm 
(Watters et al, 1997 and Brown et al, 1997; show ATM is 
also present in cytoplasmic vesicles) and A-T cells have 
also been reported to be defective in signalling from the 
cell membrane in B- and T-cells (see above) . it was 
furthermore surprising that ATM would bind DNA so well. 
The purification method used and described below does not 
purify a variety of other (known) DNA binding factors, 
yet ATM is purified very selectively (about 100-fold in a 
single step) using a DNA affinity chromatography 
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procedure . 

.The present invention provides in one aspect the use of 
DNA for purifying ATM or ATR. In further aspects, the 
present invention provides for the use of DNA in assays 
for activity of ATM or ATR, particularly phosphorylation 
of p53 (or other molecule) . 

We have also purified ATM and ATR via another surprising 
route, using nitrilo-tri-acetic acid (NTA) agarose. NTA 
has 4 chelating sites for Ni^*. Another Ni^* matrix, 
iminodiacetic acid (IDA) agarose (with 3 -chelating sites 
for Ni^*) we have found to bind ATM only weakly. These 
Ni^* matrices are generally used interchangeably to purify 
proteins that chelate metal ions, usually, via a run of 
His residues (usually 6 give best binding) . ATM does, not 
have a run of 6, 5 or even 4 His residues, so it is 
surprising that ATM or ATR is purifiable by the Ni- linked 
columns. Furthermore, since the two matrices are 
generally used interchangeably, it is further surprising 
that ATM binds to the NTA well but only poorly to the IDA 
matrix. 

ATM no doubt works in concert with other factors in the 
detection and signalling of DNA damage. Indeed, although 
our data reveal that ATM possesses intrinsic 
DNA- stimulated p53 kinase function, we have observed 
repeatedly that the presence of additional polypeptides 
correlates with increased ATM activity. Thus, our most 
highly purified preparations have considerably less 
activity than preparations containing an equivalent 
amount of ATM but also possessing additional co-purifying 
polypeptides. It is likely that these serve to help 
tether ATM to the DNA and/or trigger its kinase activity 
by altering the conformation of the ATM polypeptide. 
Accordingly, references to ATM, or a protein having a 
associated kinase activity, include l)oth purified ATM (or 
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the related protein) and ATM (or the related protein) in 
combination with associated polypeptides or co- factors 
present in preparations as obtainable by the methods 
described herein. 



Assays according to the present invention may be used in 
the identification of such additional polypeptides, for 
example by assaying for protein fractions that stimulate 
ATM activity. The use of ATM or ATR in identifying 
and/or obtaining cof actors which (e.g. naturally) enhance 
its kinase activity is further provided by the present 
invention. ATM activity may under certain circumstances 
be masked by one or more factors (see discussion section 
below) . Accordingly, the present invention also provides 
for the use of ATM in identifying and/or obtaining such 
factors. 

Protein or other co-factors of ATM, e.g. which enhance 
ATM kinase activity, may be used in the design of 
inhibitors of this, providing another route for 
modulating ATM activity. This may similarly be used to 
provide a route to deriving agents that activate ATM, 
e.g. by inhibiting one or more repressors of ATM 
activity. 

Brief Description of the Figures 

Figure 1: ATM binds to DNA. (A) ATM binds to a dsDNA 
oligonucleotide. HeLa nuclear extract was bound to 
either streptavidin iron oxide beads (-DNA) or 
streptavidin iron oxide beads bearing a 50-mer ds DNA 
oligonucleotide (+DNA) . After extensive washing, ATM was 
eluted from DNA in 500 mM KCl . Eluted proteins were 
subjected to 7% SDS-PAGE and ATM visualised by Western 
blotting using ATM.B antiserum. (B) Binding of ATM is 
dependent on DNA length. ATM enriched extract was bound 
to streptavidin iron oxide beads attached to ds DNA of 
various sizes (15, 25, 50 or 75 bp).. After extensive 
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washing, ATM was eluted by sequential washes with 100, 
250 and 500 tnM KCl . Eluates were analysed as in (A) . 
(C) ATM binds DNA containing a variety of different 
architectures. ATM enriched extract was bound to 
streptavidin iron oxide beads bound to either ss or ds 
DNA containing a nick, ds/ss transition, gap or 10 bp 
insertion. Washing, elution and ATM detection was as in 
(B) . 

Figure 2: Purification of ATM from HeLa cell nuclear 
extract. (A) ATM Purification strategy, HeLa nuclear 
extract was subjected to ion exchange chmmatography 
using Q-Sepharose and peak ATM fractions, eluting between 
160-200 mM KCl, were passed over heparin- agarose ion 
exchange resin. ATM fractions eluting from heparin- 
agarose between 200-220 mM KCl were pooled and subjected 
to DNA affinity purification and elution from DNA-bearing 
beads at 500 mM KCl resulting in an essentially 
homogeneous preparation of ATM. (B) Purification of ATM 
to essential homogeneity. Equivalent volumes (5 /il) of 
HeLa cell nuclear extract (50 /zg protein) , or pooled 
fractions following Q-sepharose, Heparin -agarose or DNA 
affinity chromatography were subjected to 7% SDS-PAGE and 
proteins visualised by silver staining (upper panel) . 
Fractions were also subjected to Western blot analysis 
(lower panel) using antibodies raised against ATM, DNA- 
PKcs/ Ku70 plus Ku80 or the 70kDa subunit of RPA, as 
indicated. 

Figure 3: Purified ATM possesses an associated p53 
kinase activity. (A) Analysis of putative ATM 
substrates. DNA-PKcs (60 ng) , Ku (100 ng) , Spl (100 ng) , 
p53 (100 ng), RPA-p34 (100 ng) or PCNA (100 ng) were used 
in kinase reactions in conjunction with approximately 11 
fmole of purified ATM (see Experimental Procedures) . 
Proteins were resolved on either 7% (left panel) or 10% 
(right panel) polyacrylamide gels' afid phosphorylated 
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proteins detected by autoradiography. (B) Analysis of 
total proteins itnmuno-precipitated from purified ATM 
= preparations . Purified ATM was biotinylated and 
subjected to immunoprecipitation using either pre-immune 
5 sera, or ATM antisera raised against amino acid residues 

1980-2337 (ATM.B) or the N-terminus (ATM.N) of ATM. 
Precipitated proteins were resolved on 7.8% 
polyacryl amide gels and, after transfer to 
nitrocellulose, total precipitated proteins were detected 

10 by probing filters with streptavidin- conjugated 

horseradish peroxidase. (C) Immunoprecipitated ATM. 
possesses p53 kinase activity. Purified-ATM was 
immunoprecipitated using pre-immune sera, or anti-ATM 
antisera ATM.B or ATM.N. Following immunoprecipitation, 

15 kinase reactions were performed either in the presence or 

absence of p53 as indicated. Phosphorylated proteins 
were resolved on 10% polyacryl amide gels and detected by 
autoradiography. 



20 



Figure 4: A DNA- stimulated protein kinase activity co- 
purifies with ATM. (A) ATM associated kinase activity is 
stimulated by linear DNA containing multiple p53 binding 
sites. Purified ATM, DNA-PK or cyclin A/cdk2 (11 fmole) , 
as indicated, were used in kinase reactions containing 
25 p53 either in the absence (-) or presence of 0.03, 0.3 or 

3 fmole of linear DNA bearing multiple p53 binding sites 
(pGijCAT) . Proteins were resolved on 10% polyacrylamide 
gels and phosphorylated proteins visualised by 
autoradiography. (B) ATM associated kinase activity does 
30 not require DNA ends. In vitro kinase reactions 

containing 11 fmole of purified ATM in conjunction with 
p53 were performed in either the absence (-) or presence 
of 0.03, 0.3 or 30 fmole of linear or supercoiled pG,jCAT 
DNA. Proteins were detected as in (A) . 



35 



Figure 5: ATM phosphorylates p53 at Serl5 and Thrl8 in 
the presence of DNA. Kinase reactions employing ATM and 
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p53 were performed in the presence and absence of DNA, 
= These studies revealed phosphorylation of p53 was 
■increased in the presence of dna. (a,b) Bands 
corresponding to »P-labelled p53 were excised from a gel, 
5 digested with trypsin, and chromatographed on a Vydac 

218TP54 CI 8 column (see Experimental Procedures) . 
Purified p53 fractions phosphorylated by ATM preparations 
in the presence, but not in the absence, of DNA (peptides 
2a, 2b and 2c) were subjected to peptide sequence 
10 analysis as described in Experimental Procedures; 

radioactivity was measured after each cycle of Edman 
degradation. The putative amino acid sequence of the p53 
. peptide showing incorporation of "P is indicated in panel 
C. (D) Tryptic peptide map of p53 phosphorylated by DNA- 
15 PK in the presence of DNA. Kinase reactions containing 

DNA-PK and p53 were performed in the presence of linear 
DNA and "P-labelled p53 was analysed as in (A.B), again 
revealing phosphorylation at Serl5 and ThrlS. 



20 



Figure 6a shows the amino acid sequence of human ATM, 
with the kinase domain marked by underlining. Figure 6b 
shows the ATM nucleic acid sequence with the initiation 
codon underlined. 



Figure 7a shows the amino acid sequence of human p53 with 
residues phosphorylated by ATM marked by underlining. 
Figure 7b shows the p53 nucleic acid sequence with the 
initiation codon underlined. 

Figure 8a shows the amino acid sequence of human ATR 
(FRP-1) . Figure 8b shows the ATR nucleic acid sequence 
with the initiation codon underlined. 

Figure 9a shows the amino acid sequence of DNA-PKcs. 
Figure 9b shows the DNA-PK nucleic acid sequence with the 
initiation codon underlined. 
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Figure 10 shows fractionation of two DNA activated kinase 
activities in HeLa nuclear cell extract capable of 
phosphorylating Serl5 of p53. Top panel; a Western 
itnmuno-blot was performed with antibodies that 
specifically recognise p53 phosphorylated on Serl5 on 
reactions in which fractions generated when HeLa cell 
nuclear extract was fractionated on Q-sepharbse were 
incubated with p53 and ATP in the presence of sonicated 
calf thymus DNA. Middle panel; the same set of fractions 
were tested for DNA-PKcs by using an anti-DNA-PKcs 
antiserum in western immuno-blot analysis. Lower panel: 
the same set of fractions were tested foi^ the presence of 
ATR by using an anti-ATR antiserum in western immuno-blot 
analysis. Additional studies revealed that both 
activities detected are stimulated by DNA. 

Figure 11 shows DNA activated kinase activity (activity 
1) CO- fractionates with ATR. Activity peak 1 was 
fractionated further on DNA-cellulose followed by 
chromatography on Heparin- agarose. Bottom panel; the 
final set of fractions was tested for p53 kinase activity 
via incubation with p53, ATP and DNA and then analysis by 
SDS-polyacrylamide gel electrophoresis and Western 
immuno-blotting using the p53 Serl5-specif ic antibodies. 
Top panel; a silver-stain of an SDS-polyacrylamide gel of 
the same set of fractions tested for p53 kinase activity. 
ATR is indicated with an arrow. 

The present invention in various aspects provides for 
modulating, interfering with or interrupting, increasing 
or potentiating interaction between the ATM protein and 
p53, particularly phosphorylation of p53 by ATM, using an 
appropriate agent. As noted, it having now been 
established for the first time that ATM is a protein 
kinase, it is highly likely to act on other molecules, 
particularly proteins including a site which is 
homologous to one of the sites in p53 phosphorylated by 
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ATM. The present invention further extends to. the use 
proteins having an associated kinase activity similar t 
ATM, especially DNA-PK and ATR. The present invention 
extends to modulation of such phosphorylation and this 
should be borne in mind when considering the disclosure 
herein which for convenience uses p53 for illustrative 
purposes, and as a preferred embodiment in certain 
contexts . 



An agent capable of modulating interaction between ATM 
and p53 may be capable of blocking interaction between 
site located within amino acid residues including SerlS 
or Thrl8.. 



In addition to interacting at the site of phosphorylation 
of p53, ATM and p53 may interact at one or more other 
sites within either or both proteins. Affecting 
interaction at such a site may have an effect on 
phosphorylation of p53 by ATM. Various fragments and 
derivatives of the proteins, particular of p53, may be 
used to analyse this, using techniques such as alanine 
scanning and deletion analysis. The present invention 
encompasses modulation of interaction between ATM and p53 
at any site, preferably resulting in modulation of p53 
phosphorylation by ATM. 

The full amino acid sequence of the ATM protein has been 
elucidated and is set out in Savitsky et ^1 1995a, 1995b, 
and Figure 6a, of which the amino acid residue numbering 
is used. The kinase domain is marked in Figure 6a. The 
p53 amino acid sequence is shown in Figure 7a, of which 
the amino acid residue numbering is used. These 
sequences are human sequences. ATM and p53 are conserved 
among vertebrates, particular mammals - see e.g. Figure 2 
of Soussi et al. For p53 conservation in the regions of 
the residues shown herein to be phosphorylated by ATM - 
so the present invention extends to xise in any of its 
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aspects of other vertebrate, particularly mammalian, p53 
and/or ATM, e.g. primate, such as monkey, rodent, such as 
mouse or rat, pig, horse, cow, sheep, goat, dog, cat, and 
so on. The amino acid and nucleic acid sequences of ATR 
5 (also known as FRPl) are set out in Cimpich et al, 1996. 

The amino acid sequence is reproduced as Figure 8a. The 
amino acid sequence of DNA-PK is provided in Hartley et 
al, 1995 and is set out in Figure 9a. The nucleic acid 
sequences of these proteins are also included as Figures 
10 6b, 7b, 8b and 9b. 

Agents useful in accordance with the present invention 
may be identified by screening techniques which involve 
determining whether an agent under test inhibits or 

15 disrupts the interaction of ATM protein or a suitable 

fragment thereof (e.g. including amino acid residues of 
the kinase domain, as marked on Figure 6, or a smaller 
fragment of any of these regions) of ATM, with p53 or a 
fragment thereof, or a suitable analogue, fragment or 

20 variant thereof. One class of preferred fragments of p53 
are those which include one or both of the 
phosphorylation sites at Serl5 or Thrl8. 

Suitable fragments of ATM or p53 include those which 
25 include residues which interact with the counterpart 

protein. Smaller fragments, and analogues and variants 
of this fragment may similarly be employed, e.g. as 
identified using techniques such as deletion analysis or 
alanine scanning. 

30 

Thus, the present invention provides a peptide fragment 
of ATM which is able to interact with p53 and/or inhibit 
interaction between ATM and p53, particularly 
phosphorylation of p53 by ATM, and provides a peptide 
35 fragment of p53 which is able to interact with ATM and/or 

inhibit interaction between p53 and ATM, particularly 
phosphorylation of p53 by ATM, such peptide fragments 
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being obtainable by means of deletion analysis and/or 
alanine scanning of the relevant protein - making an 
appropriate mutation in sequence, bringing together a 
mutated fragment of one of the proteins with the other or 
a fragment thereof and determining interaction, 
preferably phosphorylation of p53 or fragment thereof. 
In preferred embodiments, the peptide is short, as 
discussed below, and may be a minimal portion that is 
able to interact with the relevant covmterpart protein 
and/or inhibit the relevant interaction. 

Screening methods and assays are discussed in more detail 
below. 



One class of agents that can be used to disrupt the 
interaction of ATM and p53 are peptides based on the 
sequence motifs of ATM or p53 that interact with 
counterpart p53 or ATM (as discussed already above) . 
Such peptides tend to be short, and may be about 40 amino 
acids in length or less, preferably about 35 amino acids 
in length or less, more preferably about 30 amino acids 
in length, or less, more preferably about 25 amino acids 
or less, more preferably about 20 amino acids or less, 
more preferably about 15 amino acids or less, more 
preferably about 10 amino acids or less, or 9, 8, 7, 6, 5 
or less in length. The present invention also 
encompasses peptides which are sequence variants or 
derivatives of a wild type ATM or p53 sequence, but which 
retain ability to interact with p53 or ATM (respectively, 
as the case may be) and/or ability to modulate 
interaction between ATM and p53, particularly 
phosphorylation of p53 by ATM. 

Instead of using a wild-type ATM or p53 fragment, a 
peptide or polypeptide may include an amino acid sequence 
which differs by one or more amino acid residues from the 
wild- type amino acid sequence, by one or more of 
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addition, insertion, deletion and substitution of one or 
more amino acids. Thus, variants, derivatives, alleles, 
■ mutants and homologues, e.g. from other organisms, are 
included. 

Preferably, the amino acid sequence shares homology with 
a fragment of the relevant ATM or p53 fragment sequence 
shown preferably at least about 30%, or 40%, or 50%, or 
60%, or 70%, or 75%. or 80%, or 85%, 90% or 95% homology. 
Thus, a peptide fragment of ATM or p53 may include 1, 2, 
3, 4, 5, greater than 5, or greater than 10 amino acid 
alterations such as substitutions with respect to the 
wild-type sequence. Preferably, the peptide fragments of 
ATM are based on the sequence of all or part of the 
15 kinase domain as shovm in figure 6. Preferably, the p53 

fragments are based on the N- terminal sequence of the 
molecule around the sites phosphorylated by ATM, i.e 
comprising the amino acid motif PPLSQETFSD, or more 
generally, the motif SxxT, where x is any amino acid. 



10 



20 



A derivative of a peptide for which the specific sequence 
is disclosed herein may be in certain embodiments the 
same length or shorter than the specific peptide. In 
other embodiments the peptide sequence or a variant 

25 thereof may be included in a larger peptide, as discussed 

above, which may or may not include an additional portion 
of ATM or p53. 1, 2, 3, 4 or 5 or more additional amino 
acids, adjacent to the relevant specific peptide fragment 
in ATM or p53, or heterologous thereto may be included at 

30 one end or both ends of the peptide. 

(It should not be forgotten that references to ATM and 
p53 apply equally to ATM and related proteins such as ATR 
and DNA-PK and other proteins including a phosphorylation 
35 site homologous to one in p53 phosphorylated by ATM.) 

As is well -understood, homology af the amino acid level 
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is generally in terms of amino acid similarity- or 
■ : identity. Similarity allows for "conservative 

■variation", i.e. substitution of one hydrophobic residue 
•such as isoleucine, valine, leucine or methionine for 
5 another, or the substitution of one polar residue for 

another, such as arginine for lysine, glutamic for 
aspartic acid, or glutamine for asparagine. Similarity 
may be as defined and determined by the TBLASTN program, 
of Altschul et ai. (1990) J. Mol. Biol. 215: 403-10, 
10 which is in standard use in the art. Homology may be 

over the full-length of the relevant peptide or over a 
contiguous sequence of about 5, 10, 15, 20, 25, 30, 35, 
50, 75, 100 or more amino acids, compared with the 
relevant wild-type amino acid sequence. 



15 



20 



30 



As noted, variant peptide sequences and peptide and non- 
peptide analogues and mimetics may be employed, as 
discussed further below. 



Various aspects of the present invention provide a 
substance, which may be a single molecule or a 
composition including two or more components, which 
includes a peptide fragment of ATM or p53 which includes 
a sequence as recited in Figure 6 or Figure 7, 
25 particularly within the ATM kinase domain marked in 

Figure 6, a peptide consisting essentially of such a 
sequence, a peptide including a variant, derivative or 
analogue sequence, or a non-peptide analogue or mimetic 
which has the ability to interact with ATM or p53 and/or 
modulate, disrupt or interfere with interaction between 
ATM or p53. 



Variants include peptides in which individual amino acids 
can be substituted by other amino acids which are closely 
35 related as is understood in the art and indicated above. 

Non-peptide mimetics of peptides are" discussed further 
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below. 

As noted, a peptide according to the present invention 
and for use in various aspects of the present invention 
may include or consist essentially of a fragment of ATM 
or p53 as disclosed, such as a fragment whose sequence is 
shown in Figure 6 or Figure 7, respectively. Where one 
or more additional amino acids are included, such amino 
acids may be from ATM or p53 or may be heterologous or 
foreign to ATM or p53 . A peptide may also be included 
within a larger fusion protein, particularly where the 
peptide is fused to a non-ATM or p53 (i.e. heterologous 
or foreign) sequence, such as a polypeptide or protein 
domain. 

The invention also includes derivatives of the peptides, 
including the peptide linked to a coupling partner, e.g. 
an effector molecule, a label, a drug, a toxin and/or a 
carrier or transport molecule, and/or a targeting 
molecule such as an antibody or binding fragment thereof 
or other ligand. Techniques for coupling the peptides of 
the invention to both peptidyl and non-peptidyl coupling 
partners are well known in the art. In one embodiment, 
the carrier molecule is a 16 aa peptide sequence derived 
from the homeodomain of Antennapedia (e.g. as sold under 
the name "Penetratin'O , which can be coupled to a peptide 
via a terminal Cys residue. The ^^Penetratin" molecule 
and its properties are described in WO 91/18981. 

Peptides may be generated wholly or partly by chemical 
synthesis. The compounds of the present invention can be 
readily prepared according to well-established, standard 
liquid or, preferably, solid-phase peptide synthesis 
methods, general descriptions of which are broadly 
available (see, for example, in J.M. Stewart and J.D. 
Young, Solid Phase Peptide Synthesis, 2nd edition. Pierce 
Chemical Company, Rockford, Illinois- (1984), in M. 
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Bodanzsky and A. Bodanzsky, The Practice of Peptide 
Synthesis, Springer Verlag, New York (1984); and Applied 
Biosystems 430A Users Manual, ABI Inc., Foster City, 
California) , or they may be prepared in solution, by the 
liquid phase method or by any combination of solid-phase, 
liquid phase and solution chemistry, e.g. by first 
con^leting the respective peptide portion and then, if 
desired and appropriate, after removal of any protecting 
groups being present, by introduction of the residue X by 
reaction of the respective carbonic or sulfonic acid or a 
reactive derivative thereof. 



. Another convenient way of producing a peptidyl molecule 
according to the present invention (peptide or 
15 polypeptide) is to express nucleic acid encoding it, by. 

use of nucleic acid in an expression system. 

Accordingly the present invention also provides in 
.various aspects nucleic acid encoding the polypeptides 
20 and peptides of the invention. 

Generally, nucleic acid according to the present 
invention is provided as an isolate, in isolated and/or 
purified form, or free or substantially free of material 

25 with which it is naturally associated, such as free or 

substantially free of nucleic acid flanking the gene in 
the human genome, except possibly one or more regulatory, 
sequence (s) for expression. Nucleic acid may be wholly 
or partially synthetic and may include genomic DNA, cDNA 

30 or RNA. Where nucleic acid according to the invention 

includes RNA, reference to the sequence shown should be 
construed as reference to the RNA equivalent, with U 
substituted for T. 

35 Nucleic acid sequences encoding a polypeptide or peptide 

in accordance with the present invention can be readily 
prepared by. the skilled person using' the information and 
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10 



re.ferences contained herein and techniques known in the 

• art (for example, see Sambroojc. Fritsch and Maniatis, 

• "Molecular Cloning, A Laboratory Manual, Cold Spring' 
Harbor Laboratory Press. 1989, and Ausubel et al , Short 
Protocols in Molecular Biology, John Wiley and Sons, 
1992), given the nucleic acid sequence and clones 
available. These techniques include (i) the use of the 
polymerase chain reaction (PCR) to amplify samples of 
such nucleic acid, e.g. from genomic sources, (ii) 
chemical synthesis, or (iii) preparing cDNA sequences. 
DNA encoding ATM or p53 fragments may be generated and 
used in any suitable way known to those of skill in the 

. art, including by taking encoding DNA, identifying 
suitable restriction enzyme recognition sites either side 
15 of the portion to be. expressed, and cutting out said 

portion from the DNA. The portion may then be operably 
linked to a suitable promoter in a standard commercially 
available expression system. Another recombinant 
approach is to amplify the relevant portion of the DNA 
with suitable PCR primers. Modifications to the ATM or 
p53 sequences can be made, e.g. using site directed 
mutagenesis, to lead to the expression of modified ATM or 
p53 peptide or to take account of codon preference in the 
host cells used to e:q>ress the nucleic acid. 



20 



25 



30 



In order to obtain es^ression of the nucleic acid 
sequences, the sequences can be incorporated in a vector 
having one or more control sequences operably linked to 
the nucleic acid to control its expression. The vectors 
may include other sequences such as promoters or 
enhancers to drive the expression of the inserted nucleic 
acid, nucleic acid sequences so that the polypeptide or 
peptide is produced as a fusion and/or nucleic acid 
encoding secretion signals so that the polypeptide 
35 produced in the host cell is secreted from the cell. 

Polypeptide can then be obtained by transforming the 
vectors into host cells in which the "vector is 
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functional, culturing the host cells so that the 
polypeptide is produced and recovering the polypeptide 
from the host cells or the surrounding medium. 
Prokaryotic and eukaryotic cells are used for this 
purpose in the art, including strains of E. coli, yeast, 
and eukaryotic cells such as cos or CHO cells. 

Thus, the present invention also encompasses a method of 
making a polypeptide or peptide . (as disclosed) , the 
method including expression from nucleic acid encoding 
the polypeptide or peptide (generally nucleic acid 
according to the invention) . This may conveniently be 
achieved by growing a host cell in culture, containing 
such a vector, under appropriate conditions which cause 
or allow expression of the polypeptide. Polypeptides and 
peptides may also be expressed in in vitro systems, such 
as reticulocyte lysate. 

Systems for cloning and expression of a polypeptide in a 
variety of different host cells are well known. Suitable 
host cells include bacteria, eukaryotic cells such as 
mammalian and yeast, and baculovirus systems. Mammalian 
cell lines available in the art for expression of a 
heterologous polypeptide include Chinese hamster ovary 
cells, HeLa cells, baby hamster kidney cells, COS cells 
and many others. A common, preferred bacterial host is 
E. coli. 

Suitable vectors can be chosen or constructed, containing 
appropriate regulatory sequences, including promoter 
sequences, terminator fragments, polyadenylation 
sequences, enhancer sequences, marker genes and other 
sequences as appropriate. Vectors may be plasmids, viral 
e.g. 'phage, or phagemid, as appropriate. For further 
details see, for example, Molecular Cloning: a Laboratory 
Manual: 2nd edition, Sambrook et al., 1989, Cold Spring 
Harbor Laboratory Press. Many known' techniques and 
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protocols for manipulation of nucleic acid, for example 
in preparation of nucleic acid constructs, mutagenesis, 
sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 
detail in Current Protocols in Molecular Biology, Ausubel 
et al. eds,, John Wiley & Sons, 1992. 

Thus, a further aspect of the present invention provides 
a host cell containing heterologous nucleic acid as 
disclosed herein. 

The nucleic acid of the invention may be-integrated into 
the genome (e.g. chromosome) of the host cell. 
Integration may be promoted by inclusion of sequences 
which promote recombination with the genome, in 
accordance with standard techniques. The nucleic acid 
may be on an extra -chromosomal vector within the cell, or 
otherwise identifiably heterologous or foreign to the 
cell. 

A still further aspect provides a method which includes 
introducing the nucleic acid into a host cell. The 
introduction, which may (particularly for in vitro 
introduction) be generally referred to without limitation 
as "transformation", may employ any available technique. 
For eukaryotic cells, suitable techniques may include 
calcium phosphate transfection, DEAE-Dextran, 
electroporation, liposome -mediated transfection and 
transduction using retrovirus or other virus, e.g. 
vaccinia or, for insect cells, baculovirus. For 
bacterial cells, suitable techniques may include calcium 
chloride transformation, electroporation and transfection 
using bacteriophage. As an alternative, direct injection 
of the nucleic acid could be employed. 



Marker genes such as antibiotic resistance or sensitivity 
genes may be used in identifying clones containing 
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nucleic acid of interest, as is well known in the art. 

•The introduction may be followed by causing or allowing 
•expression from the nucleic acid, e.g. by culturing host 
5 cells (which may include cells actually transformed 

although more likely the cells will be descendants of the 
transformed cells) under conditions for expression of the 
gene, so that the encoded polypeptide (or peptide) is 
produced. If the polypeptide is expressed coupled to an 

10 appropriate signal leader peptide it may be secreted from 

the cell into the culture medium. Following production 
by expression, a polypeptide or peptide may be isolated 
and/or purified from the host cell and/or culture medium, 
as the case may be, and svibsequently used as desired, 

15 e.g. in the formulation of a con^osition which may 

include one or more additional components, such as a 
pharmaceutical composition which includes one or more 
pharmaceutically acceptable excipients, vehicles or 
carriers (e.g. see below) . 



20 



25 



30 



Introduction of nucleic acid encoding a peptidyl molecule 
according to the present invention may take place in vivo 
by way of gene therapy, to disrupt or interfere with 
interaction between ATM or p53 



Thus, a host cell containing nucleic acid according to 
the present invention, e.g. as a result of introduction 
of the nucleic acid into the cell or into an ancestor of 
the cell and/or genetic alteration of the sequence 
endogenous to the cell or ancestor (which introduction or 
alteration may take place in vivo or ex vivo) , may be 
comprised (e.g. in the soma) within an organism which is 
an animal, particularly a mammal, which may be human or 
non-human, such as rabbit, guinea pig, rat, mouse or 
35 other rodent, cat, dog, pig, sheep, goat, cattle or 

horse, or which is a bird, such as a chicken. 
Genetically modified or transgenic animals or birds 
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comprising such a cell are also provided as further 
aspects of the present invention. 

This may have a therapeutic aim. (Gene therapy is 
discussed below). Also, the presence of a mutant, 
allele, derivative or variant sequence within cells of an 
organism, particularly when in place of a homologous 
endogenous sequence, may allow the organism to be used as 
a model in testing and/or studying substances which 
modulate activity of the encoded polypeptide in vitro or 
are otherwise indicated to be of therapeutic potential. 
Knock-out mice, for instance, may be. used- to test for 
radiosensitivity. Conveniently, however, at least 
preliminary assays for such substances may be carried out 
in vitro, that is within host cells or in cell -free 
systems. Where an effect of a test compound is 
established on cells in vitro, those cells or cells of 
the same or similar type may be grafted into an 
appropriate host animal for in vivo testing. 

For instance, p53 function or activity may be measured in 
an animal system such as a tumour model, e.g. involving a 
xenograft, relying on active p53 . The animal may be 
subject to radio- or chemo- therapy and a test substance 
administered. An augmentation of the reaction in the 
animal to the radio- or chemo -therapy may be indicative 
of blocking of ATM phosphorylation of p53 . 

Suitable screening methods are conventional in the art. 
They include techniques such as radioimmunosassay, 
scintillation proximetry assay and ELISA methods. 
Suitably either the ATM protein or fragment or p53 or 
fragment, or an analogue, derivative, variant or 
functional mimetic thereof, is immobilised whereupon the 
other is applied in the presence of the agents under 
test. In a scintillation proximetry assay, a 
biotinylated protein fragment may be- bound to 
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10 



15 



20 



30 



streptavidin coated scintillant - impregnated beads 
(produced by Amersham) . Binding of radiolabelled peptide 
is then measured by determination of radioactivity 
induced scintillation as the radioactive peptide binds to 
the immobilized fragment. Agents which intercept this 
are thus inhibitors of the interaction. Further ways and 
means of screening for agents which modulate interaction 
between ATM and p53 are discussed below. 

In one general aspect, the present invention provides an 
assay method for a substance with ability to modulate, 
e.g. disrupt or interfere with interaction between ATM 
and p53, the method including: 

(a) bringing into contact a substance according to 
the invention including a peptide fragment of ATM, or a 
protein having an associated kinase activity, or a 
derivative, variant or analogue thereof as disclosed, a 
substance including the relevant fragment of p53 or a 
variant, derivative or analogue thereof. 



A test compound which disrupts, reduces, interferes with 
or wholly or partially abolishes interaction between said 
substances (e.g. including a ATM fragment and including a 
p53 fragment) , and which may modulate ATM and/or p53 
25 activity, may thus be identified. 

Agents which increase or potentiate interaction between 
the two substances may be identified using conditions 
which, in the absence of a positively-testing agent, 
prevent the substances interacting. 



Another general aspect of the present invention provides 
an assay method for a substance able to interact with the 
relevant region of ATM or p53 as the case may be, the 
35 method including: 

(a) bringing into contact a substance which 
includes a peptide fragment of ATM or a protein having an 
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associated kinase activity which interacts with p53 as 
disclosed, or which includes a peptide fragment of p53 
which interacts with ATM or a protein having an 
associated kinase activity, or a variant, derivative or 
analogue of such peptide fragment, as disclosed, and a 
test conpound; and, 

(b) determining interaction between said substance 
and the test compound. 



10 A test compound found to interact with the relevant 

portion of ATM may be tested for ability to modulate, 
e.g. disrupt or interfere with, ATM interaction with p53 
and/or ability to affect p53 and/or ATM activity or other 
activity mediated by ATM or p53 as discussed already 

15 above . 

Similarly, a test compound found to interact with the 
relevant portion of p53 may be tested for abiliy to 
modulate, e.g. disrupt or interfere with, p53 interaction 
20 with ATM and/or ability to affect ATM and/or p53 activity 

or other activity mediated by p53 or ATM as discussed 
elsewhere herein. 

Another general aspect of the present invention provides 
25 an assay method for a substance able to affect p53 

activity, the method including: 

(a) bringing into contact pS3 and a test compound; 

and, 

(b) determining p53 activity. 

30 

p53 activity may be determined in the presence and 
absence of ATM to allow for an effect of a test compound 
on activity to be attributed to an effect on interaction 
between p53 and ATM, preferably phosphorylation of p53 by 
35 ATM (discussed further below) . 

p53 activities which may be determined include induction 
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• of expression of a protein such as p21 (WAFl) cellular 
: sensitivity to ionizing radiation, p53-induced apoptosis 
•activity, p53- induced anti -proliferative activity, p53- 
induced senescence of cells 

5 

In assaying for agents able to modulate phosphorylation 
of p53 by ATM, suitable fragments of p53 may be employed 
including any of the sites of such phosphorylation. 
Where it is desired to determine phosphorylation at the 

10 Sens and/or Thrl8 site, DNA will generally be included 

in the assay system to stimulate the requisite kinase 
activity of ATM. As noted, the present invention extends 
also to non-human p53 and phosphorylation at sites 
equivalent to those of human p53 identified herein. 

15 Thus, the assays may employ derivatives of full length 

p53 or the p53 fragments including the phosphorylation 
sites at Serl5 and/or Thrl8. 

The present invention further provides the use of DNA for 
20 stimulating phosphorylation of p53 by ATM, e.g. in an 

assay but also in many other contexts. Such 
phosphorylation may include at the Serl5 and/or Thrl8 
site of human p53 or equivalent site in p53 of another 
species, particularly of a vertebrate such as a mammal. 

25 

An assay according to the present invention may include 
an inhibitor of DNA-PKcs kinase activity, to avoid 
complications of redundant phosphorylation by that 
kinase. Such an inhibitor of DNA-PKcs kinase activity 
30 might not affect ATM kinase activity. 

Further assays according to the present invention are for 
agents which modulate DNA binding by ATM. Inhibitors 
and/or activators may be screened using appropriate 
35 conditions for determination of DNA binding by ATM. 

Thus, a further aspect of the present invention provides 
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an assay method for a compound able to affect DNA binding 
by ATM or a protein having an associated kinase activity, 
the method including: 

(a) bringing into contact a substance which is ATM 
5 or a protein having an associated kinase activity, or a 

fragment, variant or derivative thereof able to bind DNA, 
DNA and a test compound, under conditions wherein, in the 
absence of the test compound being an inhibitor of DNA 
binding by ATM or the protein having an associated kinase 
10 activity, said substance binds said DNA; and, 

(b) determining binding between said substance and 
said DNA. 

Activators of DNA binding by ATM may similarly be • 
.15 identified using an assay method wherein said substance, 

the DNA and the test compound are brought together under 
conditions wherein in the absence of the test compound 
being a potentiator of DNA binding by ATM, the substance 
does not bind the DNA. Activators include substances 
which activate ATM associated kinase activity in the 
absence of DNA or substances which enhance the 
interaction of ATM and p53, both of which may allow the 
induction of a p53 response in the absence of DNA damage, 
e.g. as caused by irradiation. 



20 



25 



30 



DNA binding may be determined using any suitable 
technique, including an electrophoretic mobility shift 
assay (EMSA) , UV protein-DNA crosslinking, chemical or 
DNasel footprinting, and so on. 



Determination of DNA binding by ATM may be performed in 
conjunction with determination of phosphorylation, 
sequentially or simultaneously. For instance a 
preliminary screen may identify molecules which modulate 
35 DNA binding by ATM and such substances may then be used 

in assays to determine their ability (or not) to modulate 
phosphorylation of p53 . The converse, in which ability 
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to modulate phosphorylation is determined prior to 
ability to modulate ATM DNA binding, is also possible, as 
is to run two assays in parallel. 

Preliminary assays in vitro may be followed by, or run in 
parallel with, in vivo assays. 

Of course, the person skilled in the art will design any 
appropriate control experiments with which to compare 
results obtained in test assays. 

Performance of an assay method according-.to the present 
invention may be followed by isolation and/or manufacture 
and/or use of a compound, substance or molecule which 
tests positive for ability to modulate interaction 
between ATM and p53 and/or inhibit ATM or p53 activity or 
a mediated activity. 

The precise format of an assay of the invention may be 
varied by those of skill in the art using routine skill 
and knowledge. For example, interaction between 
substances may be studied in vitro by labelling one with 
a detectable label and bringing it into contact with the 
other which has been immobilised on a solid support. 
Suitable detectable labels, especially for peptidyl 
substances include "S-methionine which may be 
incorporated into recombinant ly produced peptides and 
polypeptides. Recombinantly produced peptides and 
polypeptides may also be expressed as a fusion protein 
containing an epitope which can be labelled with an 
antibody. 

The protein which is immobilized on a solid support may 
be immobilized using an antibody against that protein 
bound to a solid support or via other technologies which 
are known per se. A preferred in vitro interaction may 
utilise a fusion protein including glutathione-S- 
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transferase (GST) . This may be immobilized on 
glutathione agarose beads, in an in vitro assay format 
. ^of the type described above a test compound can be 
:. -assayed by determining its ability to diminish the amount 
5 of labelled peptide or polypeptide which binds to the 

immobilized GST-fusion polypeptide. This may be 
determined by fractionating the glutathione-agarose beads 
by SDS-polyacrylaraide gel electrophoresis. 
Alternatively, the beads may be rinsed to remove unbound 
10 protein and the amount of protein which has bound can be 

determined by counting the amount of label present in, 
for example, a suitable scintillation counter. 

An assay according to the present invention may also take 
15 the form of an in vivo assay. The in vivo assay may be 

performed in a cell line such as a yeast strain or 
mammalian cell line in which the relevant polypeptides or 
peptides are expressed from one or more vectors 
introduced into the cell. 



20 



The ability of a test compound to modulate interaction 
between ATM and p53 may be determined using a so-called 
two-hybrid assay. 

25 For example, a polypeptide or peptide containing a 

fragment of ATM or p53 as the case may be, or a peptidyl 
analogue or variant thereof as disclosed, may be fused to 
a DNA binding domain such as that of the yeast 
transcription factor GAL4 . (a particularly preferred 

30 fragment of ATM may include or be the kinase domain or a 

fragment of the kinase domain.) The GAL4 transcription 
factor includes two functional domains. These domains 
are the DNA binding domain (GAL4DBD) and the GAL4 
transcriptional activation domain (GAL4TAD) . By fusing 

35 one polypeptide or peptide to one of those domains and 

another polypeptide or peptide to the respective 
counterpart, a functional GAL4 transoription factor is 
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restored only when two polypeptides or peptides of 
interest interact. Thus, interaction of the polypeptides 
or peptides may be measured by the use of a reporter gene 
probably linked to a GAL4 DNA binding site which is 
capable of activating transcription of said reporter 
gene. This assay format is described by Fields and Song, 
1989, Nature 340; 245-246. This type of assay format can 
be used in both mammalian cells and in yeast. Other 
combinations of DNA binding domain and transcriptional 
activation domain are available in the art and may be 
preferred, such as the LexA DNA binding domain and the 
VP60 transcriptional activation domain. 

When looking for peptides or other substances which 
interfere with interaction between a ATM polypeptide or 
peptide and p53 polypeptide or peptide, the ATM or p53 
polypeptide or peptide may be employed as a fusion with 
(e.g.) the LexA DNA binding domain, and the counterpart 
p53 or ATM polypeptide or peptide as a fusion with (e.g.) 
VP60, and involves a third expression cassette, which may 
be on a separate expression vector, from which a peptide 
or a library of peptides of diverse and/or random 
sequence may be expressed. A reduction in reporter gene 
expression (e.g. in the case of jS-galactosidase a 
weakening of the blue colour) results from the presence 
of a peptide which disrupts the ATM/p53 interaction, 
which interaction is required for transcriptional 
activation of the /3-galactosidase gene. Where a test 
substance is not peptidyl and may not be expressed from 
encoding nucleic acid within a said third expression 
cassette, a similar system may be employed with the test 
substance supplied exogenously. 

When performing a two hybrid assay to look for substances 
which interfere with the interaction between two 
polypeptides or peptides it may be preferred to use 
mammalian cells instead of yeast cells. The same 
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principles apply and appropriate methods are well known 
to those skilled in the art. 

In preferred assays according to the present invention, 
5 the end-point of the assay, that is to say that which is 

determined in order to assess the effect of the test 
agent on the interaction of interest, is phosphorylation 
of p53 or a fragment, variant or derivative thereof, or 
other molecule including a phosphorylation site 
10 homologous to one of those in p53 phosphorylated by ATM. 

Thus, a further aspect of the present invention provides 
an assay method including 

(a) bringing into contact a substance which 

.15 includes at least a fragment of ATM which phosphorylates 

p53, a substance which includes at least a fragment of 
p53 including a site phosphorylated by ATM, and a test 
compound; and, 

(b) determining phosphorylation at said site. 

20 Of course, any suitable variant or derivative of ATM 

and/or p53 may be employed in such an assay. 



Phosphorylation may be determined for example by 
immobilising p53 or a fragment, variant or derivative 
thereof, e.g. on a bead or plate, and detecting 
phosphorylation using an antibody or other binding 
molecule (such as Mdm2 or a fragment thereof) which binds 
the relevant site of phosphorylation with a different 
affinity when the site is phosphorylated from when the 
site is not phosphorylated. Such antibodies may be 
obtained by means of any standard technique as discussed 
elsewhere herein, e.g. using a phosphorylated peptide 
(such as a fragment of p53) . Binding of a binding 
molecule which discriminates between the phosphorylated 
and non-phosphorylated form of p53 or relevant fragment, 
variant or derivative thereof may be assessed using any 
technique available to those skilled- in the art, which 
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may involve determination of the presence of a- suitable 
: label, such as fluorescence. Phosphorylation may be 
•determined by immobilisation of p53 or a fragment, 
•variant or derivative thereof, on a suitable substrate 
5 such as a bead or plate, wherein the substrate is 

impregnated with scintillant, such as in a standard 
scintillation proximetry assay, with phosphorylation 
being determined via measurement of the incorporation of 
radioactive phosphate. Phosphate incorporation into p53 
10 or a fragment, variant or derivative thereof, may be 

determined by precipitation with acid, such as 
trichloroacetic acid, and collection of the precipitate 
on a nitrocellulose filter paper, followed by measurement 
of incorporation of radiolabeled phosphate. 

15 

An agent able to inhibit phosphorylation of p53 by ATM 
may include an ATP analogue or other substance able to 
affect the catalytic properties of the enzymically active 
site of ATM. An inhibitor of phosphorylation may 

20 interact with ATM within the kinase domain marked (for 

human ATM) in Figure 6. Residues within this domain are 
involved with interaction with p53 and catalysis of the 
phosphorylation. Residues outside of the domain may also 
be involved in interacting with p53 and agents which 

25 interfere with such interaction may affect the 

phosphorylation as discussed elsewhere herein. 

The amount of test substance or cpiipound which may be 
added to an assay of the invention will normally be 

30 determined by trial and error depending upon the type of 

compound used. Typically, from about 0.001 nM to ImM or 
more concentrations of putative inhibitor compound may be 
used, for example from 0.01 nM to 100/iM, e.g. 0.1 to 50 
MM. such as about 10 /xM. Greater concentrations may be 

35 used when a peptide is the test substance. Even a 

molecule which has a weak effect may be a useful lead 
compound for further investigation and development. 
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Compounds which may be used may be natural or synthetic 
chemical compounds used in drug screening programmes . 
Extracts of plants which contain several characterised or 
uncharacte.rised components may also be used. 

5 

Antibodies directed to the site of interaction in either 
protein form a further class of putative inhibitor 
compounds. Candidate inhibitor antibodies may be 
characterised and their binding regions determined to 
10 provide single chain antibodies and fragments thereof 

which are responsible for disrupting the interaction. 

Antibodies may be obtained using techniques which are 
' standard in the art. Methods of producing antibodies 
15 include immunising a. mammal (e.g. mouse, rat, rabbit, 

horse, goat, sheep or monkey) with the protein or a 
fragment thereof. Antibodies may be obtained from 
immunised animals using any of a variety of techniques 
known in the art, and screened, preferably using binding 
of antibody to antigen of interest. For instance. 
Western blotting techniques or immunoprecipitation may be 
used (Armitage et al., 1992, Nature 357: 80-82). 
Isolation of antibodies and/or antibody- producing cells 
from an animal may be accompanied by a step of 
25 sacrificing the animal. 

As an alternative or supplement to immunising a mammal 
with a peptide, an antibody specific for a protein may be 
obtained from a recombinantly produced library of 

30 expressed immunoglobulin variable domains, e.g. using 

lambda bacteriophage or filamentous bacteriophage which 
display functional immunoglobulin binding domains on 
their surfaces; for instance see W092/01047. The library 
may be naive, that is constructed from sequences obtained 

35 from an organism which has not been immunised with any of 

the proteins (or fragments) , or may be one constructed 
using sequences obtained from an organism which has been 



20 



BNSDOaO: <W0 9904266A2lL> 



wo 99/04266 



PCT/GB98/02115 



43 

exposed to the antigen of interest. 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term "antibody" 
should be construed as covering any binding substance 
having a binding domain with the required specificity. 
Thus the invention covers antibody fragments, 
derivatives, functional equivalents and homologues of 
antibodies, including synthetic molecules and molecules 
whose shape mimicks that of an antibody enabling it to 
bind an antigen or epitope. 

Example antibody fragments, capable of binding an antigen 
or other binding partner are the Fab fragment consisting 
of the VL, VH, CI arid CHI domains; the Fd fragment 
consisting of the VH and CHI domains; the Fv fragment 
consisting of the VL and VH domains of a single arm of an 
antibody; the dAb fragment which consists of a VH domain; 
isolated CDR regions and F(ab')2 fragments, a bivalent 
fragment including two Fab fragments linked by a 
disulphide bridge at the hinge region. Single chain Fv 
fragments are also included. 

A hybridoma producing a monoclonal antibody according to 
the present invention may be subject to genetic mutation 
or other changes. It will further be understood by those 
skilled in the art that a monoclonal antibody can be 
subjected to the techniques of recombinant DNA technology 
to produce other antibodies or chimeric molecules which 
retain the specificity of the original antibody. Such 
techniques may involve introducing DNA encoding the 
immunoglobulin variable region, or the complementarity 
determining regions (CDRs) , of an antibody to the 
constant regions, or constant regions plus framework 
regions, of a different immunoglobulin. See, for 
instance, EP184187A, GB 2188638A or EP-A- 0239400 . 
Cloning and expression of chimeric antibodies are 
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described in EP-A-0120694 and EP-A-0125023 . 

^Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the 
present invention, as are host cells, eukaryotic or 
prokaryotic, containing nucleic acid encoding antibodies 
(including antibody fragments) and capable of their 
expression. The invention also provides methods of 
production of the antibodies including growing a cell 
capable of producing the antibody under conditions in 
which the antibody is produced, and preferably secreted. 

The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 
linkage of reporter molecules may be directly or 
indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a 
result of recombinant expression of a gene fusion 
encoding antibody and reporter molecule. The mode of 
determining binding is not a feature of the present 
invention and those skilled in the art are able to choose 
a suitable mode according to their preference and general 
knowledge. 

Antibodies may also be used in purifying and/or isolating 
a polypeptide or peptide according to the present 
invention, for instance following production of the 
polypeptide or peptide by expression from encoding 
nucleic acid therefor. T^tibodies may be useful in a 
therapeutic context (which may include prophylaxis) to 
disrupt the ATM/p53 (or ATR/p53) interaction with a view 
to inhibiting their activity. Antibodies can for 
instance be micro- injected into cells, e.g. at a tumour 
site, subject to. radio- and/or chemo- therapy (as 
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10 



discussed already above) . Antibodies may be employed in 
•accordance with the present invention for other 
therapeutic and non- therapeutic purposes which are 
discussed elsewhere herein. 

Other candidate inhibitor compounds may be based on 
modelling the 3 -dimensional structure of a polypeptide or 
peptide fragment and using rational drug design to 
provide potential inhibitor compounds with particular 
molecular shape, size and charge characteristics. 



A compound found to have the ability to rffect ATM and/or 
p53 activity has therapeutic and other potential in a 
number of contexts, as discussed. For therapeutic 

15 treatment such a compound may be used in combination with 

any other active substance, e.g. for ant i- tumour therapy 
another ant i- tumour compound or therapy, such as 
radiotherapy or chemotherapy. in such a case, the assay 
of the invention, when conducted in vivo, need not 

20 measure the degree of modulation of interaction between 

p53 and ATM (or appropriate fragment, variant or 
derivative thereof) or of modulation of p53 
phosphorylation or activity caused by the compound being 
tested. Instead the effect on DNA repair, homologous 

25 recombination, cell viability, cell killing (e.g. in the 

presence and absence of radio- and/or chemo- therapy) , 
retroviral integration, and so on, may be measured. It 
may be that such a modified assay is run in parallel with 
or subsequent to the main assay of the invention in order 

30 to confirm that any such effect is as a result of the 

inhibition of interaction between ATM and p53 caused by 
said inhibitor compound and not merely a general toxic 
effect. 

35 Thus, an agent identified using one or more primary 

screens (e.g. in a cell-free system) as having ability to 
interact with ATM and/or p53 and/of modulate activity of 
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ATM and/or p53 may be assessed further using one or more 
secondary screens. A secondary screen may involve 
testing for cellular radiosensitisation and/or 
sensitisation to radiomimetic drugs, effect on chromosome 
telomere length, inducing or preventing cell-cycle arrest 
following irradiation or other cellular insult, an effect 
of p53 induction following ionising radiation or other 
cellular insult, or induction of p21 or other downstream 
p53 target . 

Following identification of a substance or agent which 
modulates or affects ATM and/or p53 activity, the 
substance or agent may be investigated further. 
Furthermore, it may be manufactured and/or used in 
preparation, i.e. manufacture or formulation, of a 
composition such. as a medicament, pharmaceutical 
composition or drug. These may be administered to 
individuals, e.g. for any of the purposes discussed 
elsewhere herein . 



As noted, the agent may be peptidyl, e.g. a peptide which 
includes a sequence as recited above, or may be a 
functional analogue of such a peptide. 

As used herein, the expression "functional analogue" 
relates to peptide variants or organic compounds having 
the same functional activity as the peptide in question, 
which may interfere with the interaction between ATM and 
p53 . Examples of such analogues include chemical 
compounds which are modelled to resemble the three 
dimensional structure of the ATM or p53 domain in the 
contact area, and in particular the arrangement of the 
key amino acid residues as they appear in ATM or p53. 

In a further aspect, the present invention provides the 
use of the above substances in methods of designing or 
screening for mimetics of the substances. 
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Accordingly, the present invention provides a method of 

designing mimetics of ATM or p53 having the biological 
•activity of p53 or ATM binding or inhibition, the 
•activity of allosteric inhibition of p53 or ATM and/or 

the activity of modulating, e.g. inhibiting, ATM/p53 

interaction, said method comprising: 

(i) analysing a substance having the biological 
activity to determine the amino acid residues essential 
and inportant for the activity to define a pharmacophore; 
and, 

(ii) modelling the pharmacophore to design and/or 
screen candidate mimetics having the biological activity. 

Suitable modelling techniques are knovm in the art. This 
includes the design of so-called "mimetics" which 
involves the study of the functional interactions 
fluorogenic oligonucleotide the molecules and the design 
of compounds which contain functional groups arranged in 
such a manner that they could reproduced those 
interactions . 

The designing of mimetics to a known pharmaceutical ly 
active compound is a known approach to the development of 
pharmaceuticals based on a "lead" compound. This might 
be desirable where the active compound is difficult or 
expensive to synthesise or where it is unsuitable for a 
particular method of administration, e.g. peptides are 
not well suited as active agents for oral compositions as 
they tend to be quickly degraded by proteases in the 
alimentary canal. Mimetic design, synthesis and testing 
may be used to avoid randomly screening large number of 
molecules for a target property. 

There are several steps commonly taken in the design of a 
mimetic from a coitpound having a given target property. 
Firstly, the particular parts of the compound that are 
critical and/or important in determining the target 
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10 



property are determined. in the case of a peptide, this 
can be done by systematically varying the amino acid 
residues in the peptide, e.g. by substituting each 
residue in turn. These parts or residues constituting 
the active region of the compound are known as its 
"pharmacophore" . 

Once the pharmacophore has been found, its structure is 
modelled to according its physical properties, e.g. 
stereochemistry, bonding, size and/or charge, using data 
from a range of sources, e.g. spectroscopic techniques. 
X-ray diffraction data and NMR. Computafedonal analysis, 
similarity mapping (which models the charge and/or volume 
of a pharmacophore, rather than the bonding between 
15 atoms) and other techniques can be used in this modelling 

process . 

In a variant of this approach, the three-dimensional 
structure of the ligand and its binding partner are 
20 modelled. This can be especially useful where the ligand 

and/or binding partner change conformation on binding, 
allowing the model to take account of this the design of 
the mimetic. 

25 A template molecule is then selected onto which chemical 

groups which mimic the pharmacophore can be grafted. The 
template molecule and the chemical groups grafted on to 
it can conveniently be selected so that the mimetic is 
easy to synthesise, is likely to be pharmacologically 

30 acceptable, and does not degrade in vivo, while retaining 

the biological activity of the lead compound. The 
mimetic or mimetics found by this approach can then be 
screened to see whether they have the target property, or 
to what extent they exhibit it. Further optimisation or 

35 modification can then be carried out to arrive at one or 

more final mimetics for in vivo or clinical testing. 
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The mimetic or mimetics found by this approach- can then 
be screened to see whether they have the target property, 
or to what extent they exhibit it. Further optimisation' 
or modification can then be carried out to arrive at one 
5 or more final mimetics for in vivo or clinical testing. 

Mimetics of this type together with their use in therapy 
form a further aspect of the invention. 

10 The present invention further provides the use of a 

peptide which includes a sequence as disclosed, or a 
derivative, active portion, analogue, variant or mimetic, 
thereof able to interact with ATM or p53 and/or modulate, 
e.g. inhibit, interaction between ATM and p53 and/or 

15 modulate, e.g inhibit, ATM and/or p53 activity, in 

screening for a substance able to interact with p53 
and/or ATM, and/or modulate, e.g. inhibit, interaction 
between ATM and p53, and/or inhibit ATM and/ or p53 
activity. 

20 

Generally, such a substance, e.g. inhibitor, according to 
the present invention is provided in an isolated and/or 
purified form, i.e. substantially pure. This may include 
being in a composition where it represents at least about 

25 90% active ingredient, more preferably at least about 

95%, more preferably at least about 98%. Such a 
composition may, however, include inert carrier materials 
or other pharmaceutical ly and physiologicaly acceptable 
excipients. As noted below, a composition according to 

30 the present invention may include in addition to an 

inhibitor compound as disclosed, one or more other 
molecules of therapeutic use, such as an anti -tumour 
agent . 



35 



The present invention extends in various aspects not only 
to a substance identified as a modulator of ATM and p53 
interaction and/or ATM or p53 -mediated activity, property 
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or pathway, in accordance with what is disclosed herein, ' 
but also a pharmaceutical composition, medicament, drug' 
• or other composition comprising such a substance, a 

method comprising administration of such a composition to 
5 a patient, e.g. for a purpose discussed elsewhere herein, 

which may include preventative treatment, use of such a 
substance in manufacture of a composition for 
administration, e.g. for a purpose discussed elsewhere 
herein, and a method of making a pharmaceutical 
10 composition comprising admixing such a substance with a 

pharmaceutically acceptable excipient, vehicle or 
carrier, and optionally other ingredients-. 

A substance according to the present invention such as an 
15 inhibitor of ATM and p53 interaction may be provided for 

use in a method of treatment of the human or animal body 
by therapy which affects an ATM or p53 -mediated activity 
in cells, e.g. tumour cells, other purposes of a method 
of treatment employing a substance in accordance with the 
present invention are dicussed elsewhere herein. 



20 



25 



30 



Thus, the invention further provides a method of 
modulating an ATM and/or p53-mediated activity, e.g. for 
a purpose discussed elsewhere herein, which includes 
administering an agent which modulates, inhibits or 
blocks the interaction of ATM with p53 protein, such a 
method being useful in treatment where such modulation, 
inhibition or blocking is desirable, or an agent which 
increase, potentiates or strengthens interaction of ATM 
with p53, useful in treatment where this is desirable. 



The invention further provides a method of treatment 
which includes administering to a patient an agent which 
interferes with the interaction of ATM with p53 . 
35 Exemplary purposes of such treatment are discussed 

elsewhere herein. 
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Whether it is a polypeptide, antibody, peptide, nucleic 
: acid molecule, small molecule, mimetic or other 
•pharmaceutically useful compound according to the present 
invention that is to be given to an individual, 
5 administration is preferably in a ^^prophylactically 

effective amount" or a "therapeutically effective amount" 
(as the case may be, although prophylaxis may be 
considered therapy) , this being sufficient to show 
benefit to the individual. The actual amount 
10 administered, and rate and time-course of administration, 

will depend on the nature and severity of what is being 
treated. Prescription of treatment, e.g^ decisions on 
. dosage etc, is within the responsibility of general 
practioners and other medical doctors. 



15 



20 



A composition may be administered alone or in combination 
with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated. 



Pharmaceutical compositions according to the present 
invention, and for use in accordance with the present 
invention, may include, in addition to active ingredient, 
a pharmaceutically acceptable excipient, carrier, buffer, 
stabiliser or other materials well known to those skilled 
25 in the art. Such materials should be non-toxic and 

should not interfere with the efficacy of the active 
ingredient. The precise nature of the carrier or other 
material will depend on the route of administration, 
which may be oral, or by injection, e.g. cutaneous, 
30 subcutaneous or intravenous. 

Pharmaceutical compositions for oral administration may 
be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an 
35 adjuvant. Liquid pharmaceutical compositions generally 

include a liquid carrier such as water, petroleum, animal 
or vegetable oils, mineral oil or synthetic oil. 
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Physiological saline solution, dextrose or other 
saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

5 For intravenous, cutaneous or subcutaneous injection, or 

injection at the site of affliction, the active 
ingredient will be in the form of a parenterally 
acceptable aqueous solution which is pyrogen- free and has 
suitable pH, isotonicity and stability. Those of 

10 relevant skill in the art are well able to prepare 

suitable solutions using, for example, isotonic vehicles 
such as Sodium Chloride Injection, Ringex^' s Injection, 
Lactated Ringer's Injection. Preservatives, stabilisers, 
buffers, antioxidants and/or other additives may be 

15 included, as required. 

Liposomes, particularly cationic liposomes, may be used 
in carrier formulations. 

20 Examples of techniques and protocols mentioned above can 

be found in Remington's Pharmaceutical Sciences, 16th 
edition, Osol, A. (ed) , 1980. 

The agent may be administered in a localised manner to a 
25 tumour site or other desired site or may be delivered in 

a manner in which it targets tumour or other cells. 

Targeting therapies may be used to deliver the active 
agent more specifically to certain types of cell, by the 

30 use of targeting systems such as antibody or cell 

^specific ligands. Targeting may be desirable for a 
variety of reasons, for example if the agent is 
unacceptably toxic, or if it would otherwise require too 
high a dosage, or if it would not otherwise be able to 

35 enter the target cells. 

Instead of administering these agent-s directly, they may 
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be produced in the target cells by expression from an 
encoding gene introduced into the cells, eg in a viral 
vector (a variant of the VDEPT technique - see below) . 
The vector may targeted to the specific cells to be 
treated, or it may contain regulatory elements which are 
switched on more or less selectively by the target cells. 

The agent (e.g. small molecule, mimetic) may be 
administered in a precursor form, for conversion to the 
active form by an activating agent produced in, or 
targeted to, the cells to be treated. This type of 
approach is sometimes known as ADEPT or VDEPT, the former 
involving targeting the activator to the cells by 
conjugation to a cell-specific antibody, while the latter 
15 involves producing the activator, e.g. an enzyme, in a 

vector by expression from encoding DNA in a viral vector 
(see for example, EP-A-415731 and WO 90/07936) . 



10 



20 



An agent may be administered in a form which is inactive 
but which is converted to an active form in the body. 
For instance, the agent may be phosphorylated (e.g. to 
improve solubility) with the phosphate being cleaved to 
provide an active form of the agent in the body, 

25 A composition may be administered alone or in combination 

with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated, 
such as cancer, virus infection or any other condition in 
which a ATM or p53 -mediated effect is desirable. 

30 

Nucleic acid according to the present invention, encoding 
a polypeptide or peptide able to modulate, e.g. interfere 
with, ATM and p53 interaction and/or induce or modulate 
activity or other ATM or p53-mediated cellular pathway or 
35 function, may be used in methods of gene therapy, for 

instance in treatment of individuals, e.g. with the aim 
of preventing or curing (wholly or partially) a disorder 
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or for another purpose as discussed elsewhere herein. 

* Vectors such as viral vectors have been used in the prior 
art to introduce nucleic acid into a wide variety of 
different target cells. Typically the vectors are 
exposed to the target cells so that transfection can take 
place in a sufficient proportion of the cells to provide 
a useful therapeutic or prophylactic effect from the 
expression of the desired polypeptide. The transfected 
nucleic acid may be permanently incorporated . into the 
genome of each of the targeted cells, providing long 
lasting effect, or alternatively the treatment may have 
to be repeated periodically. 

A variety of vectors, both viral vectors and plasmid 
vectors, are known in the art, see US Patent No. 
5,252,479 and WO 93/07282. In particular, a number of 
viruses have been used as gene transfer vectors, 
including papovaviruses, such as SV40, vaccinia virus, 
herpesviruses, including HSV and EBV, and retroviruses. 
Many gene therapy protocols in the prior art have used 
disabled murine retroviruses. 

As an alternative to the use of viral vectors other known 
methods of introducing nucleic acid into cells includes 
electroporation, calcium phosphate co-precipitation, 
mechanical techniques such as microinjection, transfer 
mediated by liposomes and direct DNA uptake and receptor- 
mediated DNA transfer. 

Receptor-mediated gene transfer, in which the nucleic 
acid is linked to a protein ligand via polylysine, with 
the ligand being specific for a receptor present on the 
surface of the target cells, is an example of a technique 
for specifically targeting nucleic acid to particular 
cells. 
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A polypeptide, peptide or other substance able. to modlate 
or interfere with the interaction of the relevant 
polypeptide, peptide or other substance as disclosed 
herein, or a nucleic acid molecule encoding a peptidyl 
5 such molecule, may be provided in a kit, e.g. sealed in a 

suitable container which protects its contents from the 
external environment. Such a kit may include 
instructions for use. 

10 In further aspects the present invention provides for the 

provision of purified ATM and purified ATR. Purified ATM 
or ATR, for instance about 10% pure, more- preferably 
about 20% pure, more preferably about 30% pure, more 
preferably about 40% pure, more preferably about 50% 

15 pure, more preferably about 60% pure, more preferably 

about 70% pure, more preferably about 80% pure, more 
preferably about 90% pure, more preferably about 95% 
pure, or substantially pure ATM or ATR is obtainable 
using DNA. Such DNA may be in any form which ATM or ATR - 

20 bind, including single- stranded DNA, double- stranded DNA, 

nicked DNA, covalently closed DNA circles and so on. It 
is surprising that any and all of these are bound by ATM 
as shown experimentally below. 

25 In one aspect the present invention provides the use of 

DNA for purifying ATM or ATR. 

In another aspect the present invention provides a method 
of purifying ATM or ATR, the method including contacting 

30 ATM or ATR with DNA. A mixture of material including ATM 

or ATR may be contacted against immobilised DNA (e.g. on 
a bead or agarose, and either covalently or npn- 
covalently such as via a specific binding molecule such 
as streptavidin or biotin) and molecules which do not 

35 bind washed off. 

We have also established that ATM and ATR may be purified 
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using NTA, preferably in the presence of Ni^^ The NTA ma 
be on any suitable support such as agarose or sepharose. 
Thus, a further aspect of the present invention provides 
• the use of NTA, preferably with Ni^*, for purifying ATM o 
5 ' ATR. 



Another aspect of the present invention provides a method 
of purifying ATM or ATR which includes, contacting ATM or 
ATR with NTA, preferably with Ni^* and washing off 
10 molecules which do not bind. 

Purification using DNA may be combined with purification 
using NTA, preferably with Ni^% sequentially or 
simultaneously. 

15 

Either technique may be used for identification of co- 
factors of ATM which modulate ATM activity, such as 
factors which affect the interaction between ATM and DNA. 

20 The ATM contacted by DNA and/or NTA in a purification may 

be in a mixture of molecules, such as a cellular extract, 
such as from a cell of an A-T patient, a normal cell of 
an organism such as a human or a recombinant host cell 
expressing the protein from encoding DNA, such as a 

25 bacterial, eukaryotic (e.g. mammalian or yeast) or insect 

cell, such as in a baculovirus expression system. 
Purification may follow production of ATM recombinantly 
in a suitable expression system, such as a cell, by 
expression from encoding nucleic acid. 

30 

Following purification, ATM may be used as desired, e.g. 
in an assay for an agent which modulates its 
phosphorylation of p53 or other molecule, in raising or 
obtaining a specific antibody or other binding molecule, 
35 or in a therapeutic context such as to compensate in an 

individual for the absence of wild-type ATM (as in, for 
-example, a patient with A-T) . - . 
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Various further aspects and embodiments of the. present 
invention will be apparent to those skilled in the art in 
view of the present disclosure. Certain aspects and 
embodiments of the invention will now be illustrated by 
way of example and with reference to the figures 
discussed already above. 

ATM binds to DNA 

A biotinylated random ds 50-mer oligonucleotide was 
coupled to streptavidin iron-oxide particles and these 
were employed to recover DNA binding proteins from HeLa 
cell nuclear extracts. This approach revealed that ATM 
interacts with particles bearing this random piece of ds 
DNA (Figure lA) . This binding is due to the presence of 
DNA, since streptavidin iron-oxide particles alone are 
unable to bind ATM (Figure lA) . Importantly, the 
sequence specific DNA-binding protein Spl and the non- 
specific DNA interacting protein complex containing RNA 
polymerase II (Pol II) are both unable to interact stably 
with the random DNA fragment employed in these studies 
(Figure lA) . Furthermore, DNA-PKcs present in the crude 
nuclear extract binds only very inefficiently to the 
immobilised DNA despite the fact that its DNA-targeting 
component Ku is present (data not shown). Notably, 
protein quantification reveals that, under conditions in 
which over 90% of ATM binds to the DNA-coupled particles, 
less than 2% of total nuclear protein is retained. 
Hence, the retention of ATM by DNA in these studies is 
highly specific. 

The above assay revealed that ATM, or an ATM complex, is 
capable of binding to a random piece of duplex DNA. 
Additional studies revealed that ATM is also retained by 
particles containing another unrelated oligonucleotide, 
suggesting strongly that the interaction is not sequence- 
specific (data not shown) . To investigate the DNA 
binding properties of ATM further ^ we tested a series of 



10 



wo 99/04266 PCT/GB98/021 15 

58 

DNAs with a variety of sizes and architectures. In these 
studies, binding and initial washes were conducted in the 
presence of 50 tnM KCl, then bound material was eluted by 
sequential washes at 100, 250 and 500 nM KCl. Figure IB 
demonstrates that the interaction between DNA and ATM is 
dependent on the size of the DNA-duplex. Thus, with a ds 
15-mer, some ATM is still present in the unbound fraction 
and most bound material elutes in the lower salt wash. 
However, as the duplex size is increased, it becomes 
progressively more effective at binding ATM, such that 
when ds oligonucleotides of 50 bp or larger are employed, 
binding of ATM is almost quantitative and all bound ATM 
elutes in the higher salt wash (Figure IB) , 

15 Since a variety of DNA structures are known to be 

produced by IR and are present during DNA- repair 
processes, we assessed the ability of ATM to bind to 
various types of DNA structure. Thus, assays were 
conducted employing particles coupled to a ds 100 -mer 
oligonucleotide bearing a nick, a single-strand to 
double-strand transition, a gap of 35 bp, or a 10 base 
insertion loop. Notably, under the assay conditions 
employed, ATM binds to these DNA molecules with equal 
efficiency and apparent affinity as it does to the fully 
25 ds DNA oligonucleotide (Figure IC) . Additional studies 

show that ATM also binds effectively to ss DNA (Figure 
IC) and that, as with ds-DNA, this binding is dependent 
on oligonucleotide length (data not shown) . Furthermore, 
ATM binding in such experiments is competed effectively 
by linear and circular plasmid DNA, suggesting that DNA 
termini are not required for ATM binding (NDL, 
unpublished data) . Taken together, these data show that 
ATM, or a conplex containing this factor, is capable of 
interacting with DNA molecules containing a variety of 
different structures in an apparently non- sequence 
specific fashion. Our results also show that ATM prefers 
to bind to linear DNA, preferentially binding to the ends 
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of the DNA. 
Purification of ATM 

To increase our understanding of ATM further, we decided 
to attempt to purify this protein to essential 
homogeneity and thus separate it from other DNA-binding 
proteins, DNA repair factors, and protein and lipid 
kinases. The purification strategy we developed is 
outlined in Figure 2A. Since ATM is expressed 
ubiquitously and is located primarily in the cell 
nucleus, HeLa cell nuclear extract was used as starting 
material. Because no biochemical assay was available for 
ATM protein function, we monitored its purification by 
Western blot analysis using antibodies raised against two 
different portions of the protein (Lakin et al., 1996). 
This approach not only revealed the fractionation of ATM 
but also allowed us to pool fractions that were devoid of 
the abundant DNA-PK enzyme through simultaneously testing 
for the presence of DNA-PKcs.and Ku. In light of the DNA- 
binding properties of ATM, we employed a final DNA 
affinity step in the purification scheme (Figure 2B, lane 
4) . Silver staining demonstrates that this leads to an 
essentially homogenous preparation of a -350 kDa 
polypeptide, and Western blotting studies reveal that 
this is recognised strongly by ATM antiserum ATM ,3 
(Figure 2B) . Since this protein is also recognised by 
two other antibodies raised to distinct regions of the 
ATM polypeptide (data not shown) , we conclude that the 
purified protein is indeed ATM. As revealed in Figure 
2B, whilst ATM is enriched throughout the purification 
procedure, Ku, DNA-PKcs. and the abundant ss DNA binding 
protein Replication Protein A (RPA) are all efficiently 
removed. Quantitative Western blotting and silver- 
staining reveal that the final yield of ATM is 
approximately 25% and indicate that ATM is of relatively 
low abundance, comprising around 0.002% of total nuclear 
protein by weight , ■ - 
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10 



Purified ATM possesses an associated p53 kinase activity 
Notably, as for DNA-PK (Hartley et al., 1995), purified 
ATM preparations were fovind to be devoid of detectable 
kinase activity towards PI and a variety of 
phosphorylated PI derivatives. Although we cannot 
exclude that ATM phosphorylates these or related 
phospholipids under certain conditions or in the presence 
of additional components, we conclude that ATM is not a 
lipid kinase. To assay for possible ATM-associated 
protein kinase activity, we performed in vitro kinase 
assays using equivalent amounts of various recombinant or 
purified proteins that we speculated may-be ATM 
substrates. Certain candidate substrates, such as DNA- 
PKcs, Ku, proliferating cellular nuclear antigen (PCNA) , 
15 and the 34 kDa subunit of RPA (RPA-p34) , were chosen by 

virtue of their association with DNA damage detection 
and/or involvement in DNA repair. We also tested Spl and 
p53, since these are both good substrates for DNA-PK and 
because A-T cells display aberrant induction of p53 in 
20 response to IR. A final protein tested was I,B, since 

recent data have implicated this is an ATM target (Jung 
et al., 1995; Jung et al., 1997). Given that we had 
found that ATM binds to DNA, we included a DNA 
oligonucleotide known to activate DNA-PK in all initial 
25 kinase reactions. 

Notably, none of DNA-PK^s, RPA-p45 and PCNA was 
phosphorylated efficiently by purified ATM (Figure 3A) . 
However, longer exposures of autoradiograms reveals weak 

30 phosphorylation of both the 70 kDa subunit of Ku (Ku70) 

and Spl by ATM preparations (data not shown) . 
Furthermore, prolonged exposures also reveal that ATM is 
capable of autophosphorylation (data not shown), 
consistent with previous rough studies employing ATM that 

35 had been immunoprecipitated directly from crude cell 

extracts (Keegan et al., 1996) (likely to contain all 
sorts of impurities) . Most significantly, however, 
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several independently purified ATM preparations were 
• .consistently found to phosphorylate p53 with high 
.' .efficiency (Figure 3A) (contrary to the mentioned results 
of Keegan et al.) . Taken together, these data reveal 
5 that, under our assay conditions, a protein kinase 

activity co-purifies with ATM that phosphorylates p53 
efficiently, and Spl and Ku70 weakly. Iitportantly, DNA- 
PK efficiently phosphorylates p53, Spl, Ku70 and RPA-p34 
in vitro, revealing that the ATM-associated kinase 
10 activity exhibits a different substrate specificity from 

that of DNA-PK. This, together with the absence of 
detectable DNA-PKcs or Ku in our ATM preparations argues 
. strongly against the possibility that the ATM-associated 
protein kinase activity is imparted by DNA-PK 
15 contamination. 

Although the above results reveal that a p53 kinase 
activity co-purifies with ATM, prolonged silver staining 
reveals additional polypeptides in our ATM preparations 

20 (data now shown) . The possibility therefore existed that 

the p53 kinase activity that we had detected was not 
mediated by ATM but by a contaminating protein. To 
address this issue, we immunoprecipitated ATM from 
purified ATM preparations using polyclonal antibodies 

25 raised against either the N-terminal region (ATM.N) or an 

internal region (ATM.B) of the ATM polypeptide (Lakin et 
al., 1996). After washing the immunoprecipitated 
material extensively in the presence of 500 mM KCl and 
0.1% Nonidet-P40, it was employed in kinase reactions 

30 using p53 as substrate. To establish the purity of the 

immunoprecipitated material, purified ATM was 
biotinylated and immunoprecipitated in parallel with ATM 
employed in the kinase reactions. The biotinylated 
precipitated proteins were then visualised by Western 

35 transfer and probing with streptavidin conjugated 

horseradish peroxidase. 
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As illustrated in Figure 3B. a biotinylated protein of 
approximately 350 kDa in size, the predicted molecular 
mass of ATM, is precipitated in these studies by ant i -ATM 
antisera but not by pre-immune sera. Notably, no other 
proteins are consistently precipitated by both ATM 
antisera in these assays (a polypeptide of -100 kDa is 
apparent in the ATM.N precipitation in Figure 3B but is 
not present in ATM.B immunoprecipitates and was not 
consistently observed in subsequent experiments using 
ATM.N) . 

Most importantly, these experiments revealed that p53 
kinase activity is immunoprecipitated by the two ATM 
antisera. Greater ATM associated kinase activity is 
observed with ATM.N than with ATM.B, despite only 
slightly higher amounts of ATM being precipitated by 
ATM.N (Figure 3C) . One possible explanation for this is 
that AIW.B, which recognises epitopes close to the ATM 
kinase domain, impairs ATM protein kinase activity. 
These studies show that the p53 kinase activity present 
in our ATM preparations follows ATM through a further 
highly stringent immuno-af f inity purification step, and 
suggest strongly that ATM directly mediates p53 
phosphoylation. Although unlikely in our opinion, it 
remains a possibility that p53 is phosphorylated by a 
distinct polypeptide that has escaped our detection 
methods and which remains associated with ATM throughout 
the stringent purification and immunoprecipitation 
protocols employed. 

Am associated kinase activity is stimulated by DNA 
Given that ATM can interact with DNA, we investigated 
whether ATM associated protein kinase activity is 
stimulated by a nucleic acid cof actor. To achieve this, 
we performed in vitro kinase assays using purified ATM 
either in the absence or presence of increasing amounts 
of DNA. Because previous studies have revealed that co- 
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localisation of DNA-PK and Spl to the same DNA. molecule 
. increases phosphorylation efficiency (Lees-Miller et al., 
.1992; Gottlieb and Jackson, 1993), we employed a linear 
plasmid molecule bearing multiple p53 binding sites - 
5 These studies revealed that DNA addition leads to marked 

stimulation of p53 phosphorylation by DNA-PK (Figure 4A, 
middle) . Strikingly DNA addition was also found to 
result in marked stimulation of p53 phosphorylation in 
reactions containing ATM (Figure 4A, top) , Thus, 

10 purified ATM preparations contain a DNA-stimulatable p53 

kinase activity. Longer exposures of autoradiograms 
reveal that the ATM polypeptide is also subject to 
phosphorylation in such assays and that this 
phosphorylation is stimulated by DNA (data not shown) . 

15 Experiments employing equimolar amounts of DNA-PK and ATM 

revealed that the stimulation of p53 kinase activity by 
DNA is similar for ATM and DNA-PK, and that the 
stoichiometjry of p53 phosphorylation by ATM is at least 
as high as that catalysed by DNA-PK (data not shown) . 

20 Although DNA-dependent kinase activity was consistently 

observed in ATM preparations, the degree of activation 
was variable. In this regard, additional polypeptides 
were apparent in several preparations that displayed high 
levels of DNA activatability. Thus, it is possible that 

25 co-purifying polypeptides may be involved in high level 

ATM DNA dependent kinase activity. Notably, DNA-PK and 
ATM preparations both displayed significant but low 
levels of p53 kinase activity in the absence of DNA. It 
is not currently known, however, whether this reflects 

30 bona fide DNA- independent phosphorylation or results from 

small amounts of DNA in the protein preparations. 
Parallel experiments using cyclin A/cdk2 demonstrate no 
increase of p53 phosphorylation upon DNA addition (Figure 
4A) , and a variety of other protein kinases that we have 

35 tested are not stimulated by DNA. These results 

therefore show that increased protein phosphorylation is 
not a general effect of adding DNA- to p53 kinase assays 
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and reveal that ATM is highly unusual in its ability to 
be stimulated by DNA. 

We had established that ATM binds to various types of 
linear DNA molecule (see Figure 1) . Our binding 
competition studies indicated that ATM also interacts 
with supercoiled and nicked DNA (data not shown) . We 
tested whether ATM associated kinase activity is affected 
differentially by various DNA structures. p53 kinase 
assays were performed in the absence of DNA or in the 
presence of increasing amounts of either supercoiled or 
restriction enzyme -linearised plasmid DNA. 
Notably, ATM is activated by supercoiled and linear DNA 
(Figure 4B) , and additional studies revealed that good 
15 activation also occurs with nicked plasmid DNA molecules 

(data not shown). By contrast, DNA-PK is stimulated 
strongly by linear but only weakly by supercoiled plasmid 
DNA (Figure 4B; based on previous studies, the weak 
activation by the latter probably reflects small amounts 
20 of nicked and/or linear DNA in the supercoiled plasmid 

preparation) . These results are therefore consistent 
with data showing that ATM is able to interact with many 
different types of DNA structure. Furthermore, they show 
that, although ATM is analogous to DNA-PK in that its 
25 associated kinase activity is stimulated by DNA, the DNA 

cof actor requirements of the two enzymes are different. 

ATM associated kinase activity phosphorylates p53 at two 
si tes 

30 To determine the site(s) of p53 that are phosphorylated 

by ATM, bacterially expressed p53 was radioactively 
phosphorylated by ATM in either the presence or absence 
of DNA- Labelled p53 was purified by electrophoresis, 
digested by trypsin, and the resulting products separated 

35 by reverse-phase HPLC. Analysis of the resulting 

radioactive profiles showed a major peak eluting at 11- 
12% acetonitrile. A novel set of radioactive p53 derived 
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HPLC polypeptide peaks, which elute at 28-29% 
acetonitrile were induced substantially in the presence 
of DNA. Phosphoamino acid analysis revealed that the DNA 
induced peaks contained peptides labelled at both serine 
and threonine residues, suggesting either two distinctly 
labelled co-eluting peptides, or a single peptide 
containing both phosphoserine and phosphothreonine 
residues (data not shown) . Radioactive peaks with 
similar elution properties were identified following 
phosphorylation of p53 by DNA-PK (Figure 5B) or casein 
kinase I (data not shown) . Previous studies have 
revealed that both DNA-PK and casein kinase I 
phosphorylate the N-terminal region of p53 (Lees-Miller 
et al., 1992; Milne et al . , 1992). Initial attempts to 
15 sequence p53-derived peaks were unsuccessful, presumably 

because they possess blocked amino-termini. However, 
cleavage with endoproteinase Asp-N allowed sequencing of 
each. Notably, release of counts at cycles 9 and 12 of 
Edman degradation of peptide 2a reveals that the sites of 
phosphorylation correspond to p53 residues Ser-15 and 
Thr-18. Ser-15 has previously been demonstrated to be a 
phosphorylation site for DNA-PK (Lees-Miller et al., 
1992). However, no detectable DNA-PK exists in our ATM 
preparations (see above) . 



20 



25 



We therefore conclude that a novel DNA dependent kinase 
activity is associated with ATM that targets Ser-15 and 
Thr-18 of p53. 



DNA-PK ATR has an associated kinase activity that 
phosphorylates p53 at SerlS and Thrl8 

Given the fact than an activity in our ATM preparations 
was found to phosphorylate residue Thrl8 pf p53, we 
decided to test whether DNA-PK is also able to 
phosphorylate this site. To this end, p53 was incubated 
in the presence of radiolabelled [a"P] ATP with purified 
human DNA-PK (a preparation consisting of the Ku and DNA- 
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PKcs components of the enzyme; prepared as described in 
Hartley et al., 1995) in either the absence or presence 
'of a linearised plasmid DNA molecule, then, as described 
for analysis of ATM-mediated phosphorylation events, the 
p53 was treated with protease to generate phospho- 
peptides and these were analysed by reverse-phase HPLC. 
These studies revealed that, as in the ATM studies, a set 
of related peptides eluting at around 28-29% acetonitrile 
(CO- fractionating with ATM-derived peptides, 2a, b, and 
c; compare figures 5B and D) were phosphorylated by a 
DNA-PK associated kinase activity in a DNA- inducible 
fashion. Furthermore, analysis of these -revealed that 
they correspond to p53 peptides containing 
phosphorylation on residues Serl5 and Thrl9 (Figure 5B) . 
Subsequent studies using antibodies that recognise 
specifically p53 that is phosphorylated on Serl5 or Thrl9 
(see below for details of antibody preparation) confirmed 
that the DNA-PK-associated }cinase activity phosphorylates 
both of these residues of p53 . Therefore, contrary to 
expectations, DNA-PK-associated )cinase activity 
phosphorylates p53 on Thrl8 as well as SerlS. 

ATR has an associated kinase activity that phosphorylates 
p53 at SerlS 

Given that both DNA-PK-associated and a ATM- associated 
kinase activities phosphorylate p53 on SerlS and ThrlB, 
we decided to see whether other kinases exist that can 
target these residues. To facilitate this approach, we 
generated rabbit polyclonal antibodies that specifically 
recognise p53 that is phosphorylated on ThrlS (they do 
not recognise unphosphorylated p53 nor p53 that is 
phosphorylated solely on SerlB nor is phosphorylated 
elsewhere) . Similarly, we generated rabbit polyclonal 
antibodies that specifically recognised p53 
phosphorylated on SerlS. These antibodies were generated 
by immunising rabbits with specific p53 -based phospho- 
peptides (containing either ThrlS or. SerlS 
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phosphorylated) , then preparing the antibodies with the 
desired recognition characteristics (those that 
recognised the specific phosphorylated peptides but not 
unphosphorylated versions of these peptides) by 
chromatography on columns bearing immobilised 
unphosphorylated peptide and columns bearing specific 
phosphorylated peptides. 

To assess kinases activated in human cell extracts 
capable of phosphorylating p53 on SerlS, HeLa nuclear 
extract was fractionated chromatographically (see below) 
then the resulting fractions were incubated with full- 
length p53 protein and non-radioactively labelled ATP, 
either in the absence or presence of DNA. 
Phosphorylation of p53 was then assessed by subjecting 
the samples to SDS-polyacrylamide gel electrophoresis and 
Western immunoblotting. As shown in Figure 10, two main 
peaks of kinase activity (termed "activity 1" and 
"activity 2") capable to targeting Serl5 (S15) were 
detected in fractions of HeLa nuclear extract that had 
been chromatographed on Q-sepharose. Further analysis of 
these fractions revealed that both activities were 
stimulated by DNA. Furthermore, Western blotting 
revealed that fractions comprising "activity 1" contained 
the ATM- related protein ATR, whereas those comprising 
"activity 2" contained DNA-PKcs (Figure 10) . In 
addition, other experiments revealed a third weaker, 
activity peak in fractions between those comprising 
activity 1 and activity 2, which corresponded to ATM. 
Further purification of activity peak 2 revealed that it 
corresponded to DNA-PK. Further fractionation of 
activity 1 revealed that, under all chromatographic 
separation techniques utilised, the DNA-activated p53 
Serl5 kinase activity co-eluted with ATR. Indeed, 
through following this kinase activity, ATR could be 
purified to near homogeneity (e.g. Figure 11; ATR was the 
only polypeptide whose elution was found to consistently 
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parallel that of the kinase activity). Thus, in addition 
to DNA-PK and ATM targeting p53 Serl5, we have made the 
surprising discovery that this residue is also 
phosphorylated by a kinase activity associated with ATR. 

5 

Effect of p53 phosphorylation on interaction with Mdim2 
To test whether phosphorylation on SerlS or ThrlS of p53 
affects its interaction with MdTn-2, phosphorylated and 
unphosphorylated p53 -derived peptides were generated and 
10 were assessed for Mdm-2 binding by ELISA analysis. The 

four peptides used contained p53 residues 11 to 25 (in 
the sequence NH2 - SGSG EPPLSOETFfinT.WTa- rr>nH . where the 
underlined sequence is that derived from p53) that were 
unphosphorylated (1); phosphorylated on residue 
15 equivalent to p53 residue Serl5(2); phosphorylated on 

residue equivalent to p53 residue ThrlS (3) ; or 
phosphorylated on two residues, equivalent to p53 reside 
SerlS and ThrlS (4). Binding of Mdm-2 derivatives 
occurred effectively with unphosphorylated peptide i but 
was found to be inhibited dramatically in the cases of 
peptides 3 and 4, which contained phosphorylated Thrl8 . 
In contrast, binding was only impaired slightly by 
phosphorylation on Serl5 (peptide 2) . We therefore 
conclude that phosphorylation on ThrlS of p53 has a 
25 dramatic effect on its interaction with Mdm-2 and that 

phosphorylation of this site is likely to play a key role 
in regulating p53 responses in vivo. 



20 



30 



Additional purification method for ATM 
HeLa nuclear extract was applied to Ni=* - nta agarose 
(Qiagen) . We found that ATM binds very tightly to this 
matrix, but not very well to Ni^' - IDA matrices. 

5 ml of nuclear extract was loaded onto a 1 x 2.5 cm 
35 column of Ni'* - NTA agarose in the following buffer 

(Buffer D; 25 mM HEPES-KOH, pH 7.6, 100 mM KCl, 10% 
Glycerol, r mM MgCl^, 20 mM imidazole). The column was 
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washed extensively (10 column volumes) before applying a 
linear gradient of 20 mM - 500 mM imidazole in buffer D. 
Virtually pure ATM (as judged by silver stain analysis of 
8% polyacrylamide gels) eluted near the end of the 
imidazole gradient. Less pure fractions of ATM eluted at 
the start of the gradient. 

This provides a purification strategy for ATM or ATR that 
may be used alone, or in combination with various other 
chromatographic steps, e.g. DNA affinity chromatography 
as discussed already above. 

Discussion 

We have demonstrated that ATM is retained on immobilised 
particles bearing DNA molecules. Notably, ATM binds to 
both ds and ss DNA in vitro, and studies employing a 
variety of unrelated oligonucleotides provide indication 
that this interaction is not sequence dependent. By 
exploiting these and other biochemical properties of ATM, 
we have developed a strategy to purify this polypeptide 
from HeLa nuclear extracts to near homogeneity. The high 
purity of our final ATM preparations and the fact that 
ATM in such preparations can re -bind to DNA provides 
indication that ATM interacts with DNA directly. Although 
this appears somewhat different from the situation with 
DNA-PKcs, which requires Ku to associate stably with DNA 
under our assay conditions, UV protein-DNA cross-linking 
has revealed that, in the context of the DNA-PKcs/Ku 
holoenzyme, DNA-PKcs does make close contacts with DNA 
(Gottlieb and Jackson, 1993) . DNA-PKcs and ATM may 
interact with DNA through similar mechanisms. 

Because the C- terminal region of ATM possesses homology 
to the catalytic domain of mammalian PI 3 -kinase, it has 
been speculated that ATM may phosphorylate inositol 
phospholipids. However, despite conducting lipid 
phosphorylation assays under various conditions and with 
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a. variety of potential substrates, no ATM-associated 
lipid kinase activity was detected in our ATM 
preparations. These data are thus consistent with recent 
studies demonstrating that ATM-containing 
5 immunoprecipitates possess no detectable lipid kinase 

activity (Jung et al., 1997). Although we cannot discount 
the possibility that ATM modifies particular PI 
derivatives under certain conditions or in association 
with additional cof actors, we tentatively conclude that, 
10 as has been proposed for DNA-PKcs (Hartley et al., 1995) 

and FRAP (Brown et ai . , 1995), ATM is not a lipid kinase. 

In contrast, our purified ATM preparations consistently 
possess protein serine/ threonine kinase activity. 

15 

Recently (Keegan et al . , 1996) have performed rough 
experiments which might suggest that ATM-containing 
immunoprecipitates phosphorylate an -350 kDa polypeptide, 
suggesting that ATM can modify itself (though the 
20 preparations would have contained all sorts of 

impurities, including kinases). We observe that purified 
ATM preparations are capable of some degree of ATM 
auto-phosphorylat ion . 



25 



30 



35 



In addition, we have tested ATM for its ability to modify 
a variety of other polypeptides. Notably, despite the 
fact that IkB has been implicated as an ATM target by in 
vivo functional studies (Jung et al., 1995) and has 
recently been reported to be phosphorylated by 
ATM- containing immunoprecipitates (Jung et al., 1997), 
under our assay conditions we do not detect significant 
IkB phosphorylation by ATM. Although alternatives exist, 
one explanation for this discrepancy is that IkB 
phosphorylation detected in the studies of (Jung et al . , 
1997) was mediated by a co-immunoprecipitating factor 
that is separated from ATM during our purification 
scheme. ... 
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Another protein that has been implicated as a possible 
ATM target by virtue of defective regulation in A-T cells 
is RPA (Liu and Weaver, 1993; Cheng et al . , 1996). 
However, we have been unable to detect significant 
phosphorylation of RPA by ATM, suggesting that ATM 
regulates RPA indirectly, in contrast to the above, we 
observe low but detectable phosphorylation of Spl and the 
70 kDa subunit of Ku by ATM. Although the significance of 
these phosphorylation events is uncertain, these findings 
raise the interesting possibilities that ATM plays a role 
in regulating Spl -dependent transcription and controlling 
the activity of the Ku/DNA-PKcs holoenzyme. 

By far the most efficient substrate for ATM that we have 
15 identified, however,, is p53 . Importantly, the p53 kinase 

activity we have detected consistently co-purifies with 
ATM, elutes from the final DNA affinity purification step 
with the same profile as the ATM polypeptide itself, and 
further co-purifies with ATM through an additional 
20 stringent immunoprecipitation procedure. These data 

provide strong indication that p53 kinase activity is an 
inherent property of the ATM polypeptide. 

In a manner strikingly reminiscent of the activation of 
25 DNA-PK by DNA strand breaks and ds to ss DNA transitions, 

we find that ATM and ATR associated p53 kinase activity 
is stimulated markedly by the addition of a DNA cof actor. 
There are several reasons why this DNA- stimulated protein 
kinase activity is unlikely to be mediated by 
contaminating DNA-PK. First, titration studies reveal 
that, to provide the observed level of p53 
phosphorylation, the DNA-PKcs content of ATM and ATR 
preparations would have to be essentially as great as 
that of ATM itself. Clearly, this is not the case - 
35 silver staining and Western blotting reveal that, if any 

residual DNA-PK does exist in our most purified ATM and 
ATR preparations, it is present at" levels undetectable by 
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the methods employed in this study. Second, the substrate 
specificity observed in ATM and ATR preparations is 
distinct from that of DNA-PK. Third, whereas ATM and 
ATR-associated kinase activity is stimulated similarly by 
supercoiled and linear plasmid molecules, DNA-PK is only 
activated strongly by the latter. 

There are several possible ways in which ATM and ATR 
might be stimulated by DNA, and each of these may 
contribute to the effects that we observe. (The mechanism 
of action provides no limitation to the nature and scope 
of the present invention-) 

One possibility is that DNA binding by ATM and ATR 
activates the catalytic potential of the proteins 
directly. Another is that the co- localisation of ATM and 
ATR and its target DNA binding protein on the same DNA 
molecule serves to potentiate interactions between the 
kinase and its target. In line with one or both of the 
above models, we have observed that ATM 

auto-phosphorylation is also enhanced by DNA, albeit to a 
lesser degree than that observed with p53 . 

Alternatively, at least part of the dramatic stimulation 
of p53 phosphorylation upon DNA addition could be 
explained by the binding of p53 to DNA inducing a 
conformational change in p53 that makes it a more 
effective ATM or ATR substrate. Thus, Ser-15 and Thr-18 
might only become accessible to ATM after p53 is bound to 
DNA. In accordance with such a model, it is known that 
the conformation of p53 does change upon binding to DNA 
(Halazonetis et al., 1993), and it has been observed that 
several naturally occurring p53 mutants that are 
defective in sequence-specific DNA binding exhibit 
reduced phosphorylation at Ser-15 (Ullrich et al., 1993). 

Given the DNA-PK paradigm, and because of the previously 
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described role of ATM in DNA damage signalling, it might 
^ be tempting to speculate that ATM or ATR protein kinase 
.activity in vivo is triggered by specific types of DNA 
damage or stalled DNA replication forks that occur in 
response to IR. However, unlike DNA-PK, which is 
activated strongly in vitro only by DNA molecules bearing 
perturbations in the DNA double-helix, we find that ATM 
interacts with all types of DNA structure that we have 
tested. It is, therefore, possible that ATM is active 
constitutively in mammalian cells. An alternative model, 
which we currently favour, is that ATM and ATR associate 
with other polypeptides rather like DNA-PXcs interacts 
with Ku, and it is the function of these additional 
components to restrict ATM or ATR activity under normal 
circumstances and only allow their activation after 
exposure to DNA damaging agents. In this regard, it is 
interesting to note that yeast genetic data indicate the 
S. cerevisiae and S. powhe homologues of ATM or ATR 
function in conjunction with other polypeptides in DNA 
damage signalling (reviewed in Elledge, 1996; Carr, 
1997) , and that biochemical studies reveal that ATM 
exists as a large complex of -2 MDa in crude cell 
extracts (GCMS, unpublished data) . 

Together with genetic data indicating that ATM functions 
upstream of p53 in a pathway for signalling IR- induced 
DNA damage, our findings provide indication that, 
following genomic insult, ATM and ATR phosphorylate p53 
directly. Such a model would help to explain the 
deficient up-regulation of p53 in response to IR in A-T 
cells and this, in turn, would explain at least some of 
the cell cycle checkpoint control defects of A-T cells. 
Interestingly, recent studies indicate that ATM interacts 
with p53 directly (Watters et al., 1997) providing a 
possible mechanism for optimising the efficiency of 
ATM-mediated p53 phosphorylation in the cellular context. 
Indeed, since p53 itself binds to DNA strand breaks and 
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DNA insertion loops (Balkalkin et al., 1994; Lee et ai , , 
1995; Reed et al., 1995), p53 could actually play a role 
in targeting ATM or ATR to sites of DNA damage. Such a 
model is particularly attractive when one considers that 
Ser-15 and Thr-18 reside in conserved and functionally 
important regions of the p53 polypeptide. Moreover, 
Ser-15 of p53 has been shown to be phosphorylated in vivo 
(reviewed in Anderson and Lees-Miller, 1992; Steegenga et 
al., 1996) . In addition, although Thr-18 has not yet been 
identified as a physiological site for p53 modification, 
it is noteworthy that this residue is highly conserved in 
p53, and that around 8% of p53 phosphorylation in vivo 
occurs at Thr residues (Samad et al., 1986). In light of 
these points, it will clearly be of great interest to 
analyse the phosphorylation status of p53 Ser-15 and 
Thr-18 in wild-type and A-T cells, and to determine their 
degree of phosphorylation in response to IR. 

Interestingly, phosphorylation of the N-terminal region 
of p53 has been proposed to effect both the stability and 
the transcriptional activation potential of p53 (reviewed 
in Ko and Prives, 1996; Steegenga et al., 1996). Indeed, 
mutation of Ser-15 impairs the capacity of p53 to prevent 
S-phase progression and affects p53 stability (Fiscella 
et al., 1993). Furthermore, p53 mutants unable to 
activate transcription show reduced phosphorylation at 
this site (Ullrich et al., 1993). Although no experiments 
have investigated the role of Thr-18 in p53 function 
directly, it is noteworthy that this residue forms part 
of the minimal p53 binding site for Mdm2, which functions 
as a negative regulator of p53 function (Oliner et al., 
1993). Significantly, Mdm-2 binding has been linked both 
to repressing p53 -dependent transcriptional activation 
and targeting p53 for degradation within the cell (Momand 
et al., 1992; Oliner et al., 1993; Kubbutat et al., 
1997) . An attractive scenario, therefore, is that 
phosphorylation of p53 by ATM or ATR. may inhibit Mdm2 
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interaction, thus both stabilising p53 and de-repressing 
its transcriptional activity. Consistent with this, we 
find that the binding of p53 -derived peptides to Mdm2 is 
strongly inhibited by phosphorylation of ThrlS. 

5 

It is enphasized that suggested mechanisms of action and 
models for ATM and p53 function discussed above are 
presented without limitation to the nature and scope of 
the present invention. 

10 

EXPERIMENTAL PROCEDURES 



DNA interaction studies 

Oligonucleotides: one DNA strand containing a 5' biotin 

15 group (indicated by a "B" below) was annealed with 

complementary oligonucleotide (s) and bound to 
streptavidin-coated iron-oxide particles (Dynabeads; 
Dynal, Oslo, Norway). HeLa nuclear extract, or ATM 
enriched extract (Q-Sepharose pool; see below) was 

20 incubated on ice for 30 min. with the DNA- iron oxide 

particles. After washing with 5 x 0.5 ml of D* Buffer 
(25 mM HEPES-KOH, pH 7.6, 20% glycerol, 2 mM MgCl2, 0.2 
mM EDTA, 1 mM DTT, 0.5 mM PMSF, ImM Na Metabisulf ite) 
containing 50 mM KCl, protein was eluted with 500 mM KCl 

25 D* buffer or in gradual stepwise manner with KCl 

concentrations of 100 mM, 250 mM and 500 mM in buffer D* . 
Fractions were analysed for ATM protein content by 
Western blotting using a previously described rabbit 
polyclonal antisera raised against amino acid residues 

30 1980-2337 of ATM (Lakin et al., 1996). 

Oligonucleotides: 

ds 15-mer: 5' B-CCTGCCCTTGCCTGA-3' 

5 ' TCAGGCAAGGGCAGG - 3 ' 

35 ds 25-mer 5' B-CCTGCCCTTGCCTGACGCTATTAGT-3 ' 

5 ' ACTAATAGCGTCAGGCAAGGGCAGG- 3 ' 

ds 50-mer 
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5 ' B-TTGTAAAACGACGGCCAGTGAATTCATCATCAATAATATACCTTATTTTG- 
3' 

' 5' CAAAATAAGGTATATTATTGATGATGAATTCACTGGCCGTCGTTTTACAA-3' 
' ds 75-mer 

5' 

BGATCGAATCCGATAGAGTATAGATAGAGTAAAGTTTAAATACTTATATAGATAG 
AGTATAGATAGAGGGTTCAAA-3 ' 

5 ' TTTGAACCCTCTATCTATACTCTATCTATATAAGTATTTAAACTTTACTC 
TATCTATACTCTATCGGATTCGATC-3 ' 
ss 50-iner 

5 ' B - TTGTAAAACGACGGCCAGTGAATTCATCATCAATAATATACCTTATTTTG - 
3' 

For the following, a biotinylated lOO-mer oligonucleotide 
(DYNO) was used as a "backbone" to which other 
oligonucleotides were annealed. 

DYNO 5' B- 

CCTGCCCTTGCCTGACGCTATTAGTTCATCTATTTGTTTTGCTAATTCGA 
TTGGAATCGAAACGGTO^CATATTCTTTTTTCACTGATTTCCTCGGCATA- 3 ' 

nicked oligo, DYNO + DAM2 + DAM3 : ds/ss transition, DYNO 
+ DAM3; gapped ds oligo, DYNO + DAM3 + DAMS; 10 bp 
insertion, DYNO + DAM6. 

DAM2 : 

5 ' TATGCCGAGGAAATCAGTCAAAAAAGAATATGTGACCGTTTCGATTCCAA- 3 ' 
DAM3: 

5 ' TCGAATTAGCAAAACAAATAGATGAACTAATAGCGTCAGGCAAGGGCAGG - 3 ' 
DAM5: 5' TATGCCGAGGAAATC-3 ' 

DAM6: 
5' 

TATGCCGAGGAAATCAGTCAAAAAAGAATATGTGACCGTTTCGAATTAGCAAT^C 
AAATAGATGAACTAATAGCGTCAGGCAAGGGCAGG-3 ' 
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ATM purification 

All steps were performed at 4'C. HeLa nuclear extract 
. (20 ml) was applied to a Q-Sepharose column (35 ml, 1.5 x 
20 cm) equilibrated in D* buffer (25 mM HEPES-KOH, pH 
7.6, 20% glycerol, 2 mM MgClj, 0.2 mM EDTA, 1 mM DTT, 0.5 
mM PMSF, 1 mM Na Metabisulf ite) containing 50 mM KCl . 
After washing with 2 column vol. of 50 mM KCl D*, protein 
was eluted with a continuous salt gradient of 50 mM - 500 
mM KCl in D* buffer. ATM eluted between 160 and 200 mM 
KCl. Fractions containing ATM and devoid of DNA-PK (as 
judged by Western blot analysis) were pooled and, after 
diluting to 100 mM KCl in D* buffer, were, loaded onto a 
heparin agarose column (1.5 x 6 cm) pre -equilibrated in 
100 mM KCl D* buffer. The column was washed with 2 
column vol. of 100 mM KCl D* buffer before eluting with a 
continuous gradient of 50 mM - 500 mM KCl in buffer D* . 
ATM was again followed by Western blot analysis and 
eluted between 200 and 220 mM KCl. Peak fractions were 
pooled and dialysed against 50 mM buffer D* . Peak ATM 
fractions were then incubated with gentle mixing for 1 h. 
with 200 fig biotinylated 50 bp ds DNA conjugated to 
streptavidin iron-oxide particles. Unbound protein was 
rebound to fresh DNA-iron oxide particles. Particles 
were collected via a magnet and were washed 5 x with 0.5 
ml of 50 mM KCl D* buffer before eluting ATM with 2 x 75 
/il 500 mM KCl buffer D* . Purified ATM was snap-frozen 
and stored at -70 'C. 

ATR purification 

ATR purification was carried out as set out in the 
description. 

Immunological Methods 

Western immunoblot analysis was performed as previously 
(Lakin et al., 1996). Spl antisera were purchased from 
Serotec Ltd. (Oxford, UK) . RPA-p70 and RNA polymerase II 
antisera were also utilised. PhospKo-specif ic antisera 
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were generated as described herein. 

Immunoprecipi tat ions were performed by incubating 
biotinylated or untreated purified ATM in parallel with 
serum for 1 h. on ice in D* buffer containing 50 mM KCl . 
Protein A Sepharose was added and the reaction incubated 
with slow rotation for a further h. at 4*C. Beads were 
washed at high stringency seven times in 500 /il of D* 
buffer containing 500 mM KCl and 0.1% NP-40. 
Biotinylated immunoprecipitated proteins were visualised 
by 7% SDS-PAGE followed by Western blotting and probing 
with streptavidin- conjugated horse-radish peroxidase. 
Un-biotinylated immunoprecipitated proteins were washed a 
further two times in 500 /xl i x Z' buffer prior to 
addition to kinase reactions (see below) . 

Phosphorylation assays 

Kinase reactions were performed in 20 /il containing: 10 
Ml Z' buffer (25 mM HEPES-KOH pH 7.9, 50 mM KCl, 10 mM 
MgClj, 20% glycerol, 0.1% NP-40, ImM DTT) ; 11 fmol ATM, 
DNA-PK or cyclin A/cdk2; 50-100 ng substrate and 0-30 
fmol of DNA. Reactions were assembled and incubated for 
3 min. on ice prior to addition of 10 /zCi [7-^'Pl ATP and 
incubation at 30 'C for 15 min. Phosphorylated proteins 
were subjected to 7% SDS-PAGE and visualised by 
autoradiography. 

Mapping of p53 phosphorylation sites 

Recombinant p53 (10-20 pmol; purified as previously (Hupp 
et al., 1992)) was incubated with 12-24 ng of purified 
ATM or DNA-PK in the presence of 100 fiM ATP containing 
10*-10' cpm/nmol t"P] -7ATP under reaction conditions 
described above. Linearised (pGij-CAT) or supercoiled 
(pBS-SK; Stratagene, USA) DNA were included in DNA-PK and 
ATM reactions, respectively, where indicated. After 30 
min. at 30 'C, reactions were terminated by transferring 
to an ice water bath. Following TCIA. precipitation. 
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labelled p53 was resolved by 10% SDS-PAGE and visualised 
by autoradiography. The gel section containing labelled 
. p53 was excised and the protein eluted and TCA 
•precipitated as described (Alessi et ai., 1996). The 
washed TCA pellet was either digested directly with 
alkylated trypsin (Promega, Southampton, UK) or, for ASP- 
N digestion, solubilised first in 0.2% v/v Triton X-lOO 
and digested overnight with 1:5 w:w Asp-N (Boehringer 
Mannheim) and, where indicated, followed by overnight 
digestion with trypsin. The supernatant containing 
digested protein was chromatographed on a Vydac 218TP54 
C18 column (Separations Group, Hesperia, _CA) equilibrated 
with 0.1% v/v triflouroacetic acid (TFA) , and eluted with 
a linear acetronitrile gradient. The flow rate was 0.8 
ml/min. and 0.4 ml fractions were collected. Peak 
fractions were coupled covalently to a Sequelon 
acrylamide membrane and analysed on an Applied Biosys terns 
470A sequencer using the modified programme described by 
(Stokoe et ai., 1992) to determine Edman degradation 
cycle numbers corresponding to radioactivity release. 

Additional Purification of ATW 

Purification using NTA has been described already above. 
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Claims ; 

1. An assay method for a compound able to modulate the 
interaction between (i) ATM or a protein having an 
associated kinase activity and (ii) p53 or any protein 
having ATM phosphorylation sites homologous to those of 
p53, the method including the steps of: 

(a) bringing into contact a substance including a 
peptide fragment of (i) or a derivative, variant or 
analogue thereof, a substance including the relevant 
fragment of p53 or any protein having phosphorylation 
sites homologous to p53, or a variant, derivative or 
analogue thereof, and a test compound; and 

(b) determining interaction or binding between said 
substances and test compound. 

2. An assay method for a compound able to modulate the 
interaction between (i) ATM or a protein having an 
associated kinase activity and (ii) p53 or any protein 
having ATM phosphorylation sites homologous to those of 
p53, the method including the steps of: 

(a) bringing into contact a substance which 
includes at least a fragment of (i) which phosphorylates 
p53 or any protein having phosphorylation sites 
homologous to those of p53, a substance which includes at 
least a fragment of p53 or any protein, which includes a 
site which is phosphorylated by (i) , and a test compound; 
and 

(b) determining phosphorylation at said site. 

3 . An assay method for a compound able to affect p53 
activity by modulation of interaction between (i) ATM or 
a protein having an associated kinase activity and (ii) 
p53, the method including the steps of: . 

(a) bringing into contact a substance which is p53 
or a fragment, derivative, variant or analogue thereof, 
and a test compound; and 

(b) determining p53 activity in the presence and 
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absence of (i) . 

. '4. The method according to any one of claims 1 to 3 
wherein the protein having an associated kinase activity 
5 is DNA-PK or ATR. 

5. An agent capable of modulating interaction between 
(i) ATM or a protein having an associated kinase activity 
and (ii) p53 or any protein having ATM phosphorylation 

10 sites homologous to p53 obtained using a method according 

to any of claims 1 to 4 . 

6. An agent according to claim 5 capable of modulating 
ATM-mediated phosphorylation on p53 or said homologous 

15 sites. 

7. A peptide fragment of p53 capable of modulating 
interaction between ATM or a protein having an associated 
kinase activity and p53 . 

20 

8. A peptide according to claim 7 capable of modulating 
phosphorylation of p53 by ATM or by a protein having an 
associated kinase activity. 

25 9. A peptide according to claim 7 or claim 8 which has 

a sequence f ovmd in human p53 including Thrl8 . 

10. A peptide according to claim 7 or claim 8 which has 
a sequence found in human p53 including SerlS. 

30 

11. A peptide according to any one of claims 7 to 10 
wherein the protein having an associated kinase activity 
is DNA-PK or ATR. 

35 12. A nucleic acid isolate encoding a peptide according 

to any of claims 7 to 11. 
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13 . A peptide fragment of ATM or a protein having an 
^associated kinase activity which is capable of modulating 
.interaction between ATM or the protein and p53. 

5 14. A peptide according to claim 13 capable of 

modulating phosphorylation of p53 by ATM or a protein 
having an associated kinase activity. 

15. The peptide of claim 13 or claim 14 wherein the 
10 protein having an associated kinase activity is DNA-PK or 

ATR. 

.16. A nucleic acid isolate encoding a peptide according 
to any one of claims 13 to 15. 

15 

17. An agent or peptide fragment or nucleic acid isolate 
according to any of claims 5 to 16 for use in a method of 
treatment by therapy involving modulating ATM action. 

20 18. Use of an agent or peptide fragment or nucleic acid 

isolate according to any of claims 5 to 17 in the 
manufacture of a medicament for modulating ATM action. 

19. An assay method for a compound able to affect DNA 
25 binding by ATM or a protein having an associated kinase 

activity, the method including the steps of: 

(a) bringing into contact a substance which is ATM 
or a protein having an associated kinase activity, or a 
fragment, variant or derivative which is able to bind 

30 DNA, DNA, and a test compound, under conditions wherein, 

in the absence of the test compound being an inhibitor of 
DNA binding by ATM or the protein having an associated 
kinase activity, said substance binds DNA; and 

(b) determining binding between said substance and 
35 said DNA. 

20. The assay method according to* claim 19 wherein the 
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protein having an associated kinase activity is dna-pk or 
ATR. 



21. An agent capable of affecting DNA binding by ATM 
obtained using a method according to claim 19 or claim 
20. 



22. An agent according to claim 21 for use in a method 
of treatment by therapy involving modulating ATM action. 

10 

23. Use of an agent according to claim 21 in the 
manufacture of a medicament for modulating ATM action. 

24. A method of purifying ATM or ATR including 

15 contacting a mixture of molecules including ATM or ATR 

with DNA, washing off molecules which do not bind the 
DNA, and recovering ATM or ATR from the DNA-bound 
fraction. 



20 



30 



25. Use of DNA for purifying ATM or ATR. 



26. A method of purifying ATM or ATR, including 
contacting a mixture of molecules including ATM or ATR 
with NTA, washing off molecules which do not bind the 

25 NTA, and recovering ATM or ATR from the NTA-bound 

fraction. 

27. A method according to claim 26 wherein the mixture 
is contacted with NTA in the presence of Ni^* . 



28. Substantially pure AIM. 

29. Substantially pure ATR. 



35 
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/gene="ATM* 

/ trans la t ion= - MSLVIJaDIJLI(XROLEHDRATEIWKEVEKF^^ 
HSDSKCXSKYli^VffiAVFIU^LQKYIQKETECUlI^ 
IKCANTmPRLKCXJELUOYIMrnVKDSSNGA 
Wl^FSVYFRLYLKPSQDVHRVLVT^IIHAVTKGCCSQTDGU^S 
EKSSSGUOHIIJ^TIFUCriAVNFRIRVCELGDEILPTLLYI^ 
LTOLQr!fIHHPKGAKlX}EKGAYESTK>n^ILYNLYDI^VNEISHIGSRGK^ 
VKET^IEUdADICHQVFNEDTOSLEISOSYTriXJRESSDYSV^ 
LQKSQNDFDLVPWLQIATOLISKYPASLPNCELSPUICLSQI^PQQ 
LTEVTO/TQDKRSNLESSQKSDUiKLWNKIWCITFTlGISSEQIOA^^ 
EVDREFWKIJTX3SACRPSCPAVCCl.TIJUjTrSIVP^ 
MKWLLFYQLBGDLENSTEVPPIUISNFPHLVLEKILVS^ 
HQKIKEELSFSE^7EELFU7ITFDKMDFLTI\mBCGIEKHQSSIGFS^^ 
USLSEQLU^NYSSEITNSETLWCSRUiVGVI^^ 
QCAGESITLFKNKTNEEFRIGSU^NmQIXTroC^^ 
NDIADICKSIASFIKKPFDRGEVESMEDIXnJCSNIiffi^ 
PGESQSTIGAINPLAEEyLSKQDLLFIJDMLKFl/^^ 
IDSSTLEPTKSU^LH^nfIJMLLKELPGEEyPLP^!EDVLELLKP 
lUmVLHWKNZjGQSNMDSENTRDAQGQFLTV^GAFWHLT^^ 
LLEADPYSKWAILNVMGKDFP\mEVFTQFlJa3^Q\^^ 
UCAIJ^UOiQQTAFENAYLKAQECMREMSHSAENPETLDEry^^ 
ICEKQALFAIJCKSVKE^K^JEPHLVKKVLEKVSETFGYRR^^ 
DTEYl^SSFPFILI^rmilEDFyRSCYKVLIPHLVI^ 
IXrFPKILVNILPYFAYBGTraSGMAQQRETATKVYiaiLKSEI^^ 
WELI2miHEPANSSAS0STOLCDFSGIUI)PAPNPPHFPSHVI^ 
SILEILSKSPDSYQKILLAICEOAAETONVYKKHRILKIYHLFS^IJil^ 
AFVaJU3Vn[YTLIHYINQRPSCraiDVSLRSFSLC^ 
VGTLIPLVYBQVEVQKQVU^IJUKyLVirmDNE^ 
KIKYSRGPFSLI^INHFLSVSVYDALPLTRIJEXSOKDIJUIQLEI^^ 
PODGIMVKL\A?NLLQLSKMAINHTGEKEVLEAVGSa^ 
KALKU^KELQOTTIMLTyi2Iim.VEDC^ 

TOPMI^YLQPFRTSRKKFLEVPRFDKn^FEGIJJDimmPI^eQ^ 
SGGTKCEILQIAJCPMCEVKTDFCQTS^PYLIHDILLQDOt^ 
RHFS(yrSRSrrPAmjDSESEHFFRCClJDKKSQRTOrAVAmYMra^ 
U3LNYLEVAKVAQSCAAHFTAI^YAEnrADKKSMDIX?EKRSI^^ 
SKEETGISLQDLLLEIYI^IGEPDSLYGCGGGKMLQPITRLRTyEHEA^^ 
ETAIPSSTOQAGII0AU?^^^U:HILSVY1^GLDYHOT3WCPELEEIJ^^ 
HCTSVSKEVBGTSYHESLYNALQSUUDREFSTFYESUCY^ 
YPTLSRLQAIGELESIGELFSRSVTHRQLSEVYIKWQKHSQLLK^ 
VILEIU^EKE^ImSQRECIKDILTKHLVELSIIJ^TFKmX}LPER^ 
VSEl^LEEAQVFWAKKEQSIJa.SIIJC<^KKLEASCA;^^ 
ETCXENPAVIMQTYLEKAVEVAGrraX3ESSDELRNGKMK^^ 
KSSEFEWKQALLKRAKEEVGLIiRmKIQTimYTVK\^ 
KAVENYINO^GEEHia^WVFRI/ISIM^ 
AARM3TKMMGGU3FHEVI2^NLISRISMDHPHHTLFII^ 
ITTO«VPKQSSQIiDEDRTEAANRIICTIRSRRPQM\niSVEALCn^^ 
RKGINIPAIX?PITKIJCNLED\AA^PTMEIKVD^^ - 
IipCVGSpGKERRQLVKGRDDIJlQDAVMQQVFCyiCl^^ KINASE 
LSQRSGVLBVCTGTVPIGEFLVNNKUjAHi^ DOMAIN 
VFMDVCONFOP\^YFCM2aTJ5PAIWFEKRIAYTR 
INBQSAELVHIDIiGVTVFBQGKILPTPEriVPFT^ 
VMRNSOETLliTrVEVIjLYDPLFPtfmNPLKAL 
SDiroSFPKVAERVIJlRLQEKLKGVEBCnVI^^ 



Sequence 9385 BP; 3030 A; 1685 C; 1973 G; 2697 T; 0 other; 
U33841 Length: 9385 July 10, 1998 12:06 TVpe: N Check: 7765 
1 GCGAGAGGAG TCGGGATCTG CGCTGCAGCC ACXXXTGCGG TTGATACTAC 
51 TrrGACCTTC CGAGTGCAGT GAGGCATACA TCACAATTTG GAATTATGCA 
101 rrOGTITATC AATTTACTTG ITTAITGTCA COCTGCTQCC CAGATATGAC 
151 TTCATGAOGA CAGTGATGTG TGITCTGAAA TTGTGAAOCA TGAGTCTAGT 



Figure 6b (1) 
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Fl^jre 6b (il) 

201 ACTTAATGAT CTGCTTATCT GCTGCOGTCA ACTAGAACAT GATAGAGCTA 
251 CAGAACXWVA GAAAGAAGTT GAGAAATTTA AGCXXXTCAT TCGAGATCCT 
301 GAAACAATTA AACATCTAGA TCGGCATTCA GATTCCAAAC AAGGAAAATA 
351 TITCAATrGG GATGCTGTIT TTAGATnTT ACAGAAATAT ATTCAGAAAG 
401 AAACAGAATG TCTGAGAATA GCAAAACCAA ATOTATCAGC CTCAACACAA 
451 GCCTCCAGGC AGAAAAAGAT GCAGGAAATC AGTACTITCG TCAAATACTT 
501 CATCAAATGT GCAAACAGAA GAGCACCTAG GCTAAAATOT CAAGAACTCT 
551 TAAATTATAT CAT3GATACA GTGAAAGATT CATCTAATCG TCCTATTTAC 
601 GGAGCTGATT GTAGCAACAT ACTACTCAAA GACATTCTTr CTCTCAGAAA 
651 ATACTGGTGT GAAATATCTC AGCAACAGTG GmOAATTO TICICIGTCT 
701 ACTTCAGGCT CTATCTGAAA CCTICACAAG AOCTTCATAG AGITmOTO 
751 GCTAGAATAA TTCATOCTGT TACCAAAGGA TCCTOITOC AGACTXSACGG 
801 ATTAAATTCC AAATTmTM ACrmriTC CAAGGCTATO 
851 GACAAGAAAA GAGCTCITCA GGICTAAATC ATATCITAQC 
90X ATCTTOCTCA AGACmOGC TGTCAACnT CGAATTOGAG TCOGTCAATT 
951 AGGAGATGAA ATTCTTCOCA CTTTQCnTA TATITCGACT CAACATAQGC 
1001 TTAATGATTC TTTAAAAGAA GTCATTATTO AATTATITCA ACTCCAAAOT 
1051 TATATOCATC ATCCGAAAGG AGCCAAAACC CAAGAAAAAG GTOCTTATCA 
1101 ATCAACAAAA TGGAGAAGTA TTITATACAA CTOATATOAT CTOCTAGIGA 
1151 ATGAGATAAG TCATATAGGA AGTAGAGGAA AGTATTCrrc AGGATITCGT 
1201 AATATO300G TCAAAGAAAA TITGATrGAA TIGATCGCAG ATATCTGICA 

1251 cxMonrrr aatgaagata ccaGATCCiT osAGAWicr caatcitaca 

1301 CTACTACACA AAGAGAAOCT AGTOATTACA GKSICOCITC CRAAAGGAAG 

1351 AAAATAGAAC TAGGCTOQGA AGHRATRAAA GAOCAOCnC AGAAGTCftCA 

1401 GAATCATTTT GATCTIGTQC CITOQCEACA GATTCCAACC CAAOTAATAT 

1451 CAAAGTAICC TCCAAGITEA OCTAACTCTC AGCTOICTOC AmOGATO 

1501 ATACTATCIC AGCTTCTACC CCa^CAQOSA CATO^^ 

1551 TCTGTTAOGA TGCCTTAOSG AAGlTQCATr GTOICAAGAC AAGAQGTCAA 

1601 ACCTAGAAAG CTCACAAAAG TCAGATTEAT -nVAAAClCro GAATAAAATT 

1651 TQGTOIATTA CCITrOCTGG TATAAGITCT GAGCAAATAC AAQCTOAAAA 

1701 CTTTOGCTTA CTTOGAGCCA TAATI^ 

1751 GAGAATTCTG GAAGTTATrr ACTGQC3ICAG OCTOCaOACC TTCATOTOCT 

1801 GCACTATQCT CTrrcACTIT GQCACTGRDC ACCAGIATRG ITCCAGGRQC 

1851 OC3TAAAAATG GGAATAGAQC AAAATATSTO TGAA£SHUVAT AGAAQCTTTT 
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F^i^^ 6 b (ill) 

1901 CriTAAAGGA ATCAATAATG AAATOC3CTCT TATTCTATCA GTTAGAGGCTr 

1951 GACITAGAAA ATAGCACAGA ACTGCXTTCrA ATTCTTCACA GTAATTITCC 

2001 TOVTCTICTA CTOGAGAAAA TTCTTCTOAG TCTCACTATC AAAAACTOTA 

2051 AAGCTGCAAT GAATTnTTC CAAAGCGTGC CAGAATGTCA ACACCACCAA 

2101 AAAGATAAAG AAGAACTTTC ATTCTCAGAA GTAGAAGAAC TATTICITCA 

2151 GACAACTTTT GACAAGATGG ACTTTTTAAC CATTCTCAGA GAATCTOC?rA 

2201 TAGAAAAGCA CCAGTCCAGT ATTGGCTICT CTGTCCACCA GAATCTCAAG 

2251 GAATCACTGG ATOGCTGTCT TCTGGGATTA TCAGAACAGC TTCTCAATAA 

2301 TTACTCATCT GAGATTACAA ATTCAGAAAC TCTTCTCCGG TCTTCACOrc 

2351 TrrTGGTGGG TGTCCTTOGC TGCTACTGTr ACATX3GGTCT AATAGCTCAA. 

2401 GAGGAAGCAT ATAAGTCAGA ATTATTCCAG AAAGCCAACT CTCTAATCCA 

2451 ATCTGCAGGA GAAAGTATCA CTCTGTrTAA AAATAAGACA AATCAGGAAT 

2501 TCAGAATIGG TTCCTTGAGA AATATGATGC AGCTATCTAC ACX5TTCCITC 

2551 AGCAACTGTA (XAAGAAGAG TCCAAATAAG ATTGCATXTTC GCTTnTCCT 

2601 GCGATTGTTA ACATCAAAGC TAATGAATCA CATTX3CAGAT ATITCTAAAA 

2651 CnTTAGCATC CTTCATCAAA AAGCCATTTG ACCGTCGAGA AGTAGAATCA 

2701 ATQGAAGATG ATACTAATGG AAATCTAATG GAGGTCGAGG ATCAGTCATC 

2751 CATGAATCTA TTTAACXyiTT ACCCTQATAG TAGTGTTAGT GATCCAAACG 

2801 AACCTQGAGA GAGCCAAAGT ACCATAGGTG CCATTPJilCC TTTAGCTCAA 

2851 GAATATCTGT CAAAGCAAGA TCTACTTrTC TTAGACATGC TXIIAAGlTCTr 

2901 GTCSriTGTCT GTAACTACTG CTCAGAOCAA TACICTGTCC TTEAGGGCAG 

2951 CTOATATTOG GAGGAAATTG TTAATGITAA TTGATTCTAG CACGCTAGAA 

3001 CCTAOCAAAT 0CCICCAC3CT GCATATGTAT CTAATOCTTT TAAAGGAGCT 

3051 TOCTOGAGAA GAGIMCCCT TCOCAATOGA AGATOTICTT GAACTTCTOA 

3101 AAOCACTATC CAATCTGTOT TCTITOEATC GTCGTCACCA AGATGTTICT 

3151 AAAACTMTT TAAACCAOCT OCTTCATGrA GTCAAAAACC TAQGTCAAAG 

3201 CAATAT0C3AC TCTGAGAACA CAAQGGATCC TCAAGGACAG ITICTEACAG 

3251 TAATTQGAGC ATTITOGCAT CTAACAAAGG AGAGGAAATA TATA1TCTCT 

3301 CTAAGAATOG CTCTACTAAA TPQCXOTAAA ACTTTOCITC AGQCTOATCC 

3351 TIATICAAAA TQOQCCATTC TTAATCTAAT GGGAAAAGAC TTICCTCTAA 

3401 ATGAAGTATT TACACAATTT CTTGCIGACA ATCATCACCA ACHTCGCATC 

3451 TTOCSCroCAG ACTCAATCAA TAGATTCITC CAGGACAOSA AOQGAGATIC 

3501 TTOGAOfflTA CTCAAAQCAC TlOCnTGAA GCTTCAGCAA ACAGCITITC 
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., 3551 AAAATOCATA CITCAAAGCT CA0C3AAGGAA TCAGAC5AAAT CTTCCCATAGT 
. 3601 GCTCAGAACC CTOAAACITr GGATGAAATT TATAATAGAA AATCTCTITr 
■ 3651 ACTCAOGTTG ATAGCTCTCG TTTTATCCTG TAGCCCTATC TCCGAAAAAC 
3701 AGGCTITCTT TGCCCTCTGT AAATCTGTGA AAGAGAATCG ATTAGAACCT 
3751 CAOCTTCTGA AAAAGGTTrT AGAC3AAAGTT •ICTCAAACTr TTOGATATAG 
3801 ACXJTTTAGAA GACTTTATGG CATCTCATIT AGATTA-ICTC CTTITCGAAT 
3851 GGCTAAATCT TCAAGATACT GAATACAACT TATCTICTTT TCCnTTATT 
3901 TTATTAAACT ACACAAATAT TCAGGATITC TATAGATCTT CHTATAAGGT 
3951 TTTGATTOCA CATCTGGTC3A TTAGAAGTCA TTITCATCAG CHGAAGTCCA 
4001 TTCCTAATCA GATTCAAGAG GACTGGAAAA GTCTTCTAAC AGACTOCriT 
4051 CCAAAGATTC TTGTAAATAT TCTTCCTTAT TITCCCTATC AGGGTACX»G 
4101 AGACAGTGGG ATGGCACAGC AAAGAGAGAC TCCTACCAAG GICTATCATA 
4151 TGCTTAAAAG TGAAAACTTA TTOGGAAAAC AGATTOATCA CnAlTCATr 
4201 AGTAATTTAC CAGAGATTGT GGTCGAGTTA TTOA1X3ACGT TACATCftGCC 
4251 AGCAAATTCT AGTCCCAGTC AGAGCACTGA CCICTOTCAC TnTCAGQGG 
4301 ATrrGGATCC TGCTCCTAAT CCACCTCATT TTCCA'TCGCA TCTCATEAAA 
4351 GCAACATTTG OCTATATCAG CAAITCTCAT AAAACCAACT TAAAAAGCAT 
4401 TTTAGAAATT CTTICCAAAA GCCCTQATIC CTA7CAGAAA AnCITCITC 
4451 CCATATGTGA GCAAGCAGCT GAAACAAATA ATCTTTATAA GAAGCACAGA 
4501 ATTCTTAAAA TATATCACCT GTrrGTTAGT ITATTACKSA AAGATATAAA 
4551 AACTOGCTTA GGAGGAGCTT GGGOCTriGT TCITCGAGAC GrrATTTATA 
4601 CTTTGATrCA CTATATCAAC CAAAGGCCTT CITCTATCAT QGAIGroiCA 
4651 -mCGTAGCr TCTCCXTTTG ITCIGACTnv TTAAGICAGG TITCCCAGAC 
4701 AGCOGTGACT TACTGTAAGG ATQCTCTAGA AAACCATCTr CATCirATTC 
4751 TIGOTACACT TATACCCCTT GTCTATCAGC AGGTCGAGGT TCAGAAACAG 
4801 GJAOTOSACT TCTTGAAATA CITAGTGATA G^^ 

4851 CnCTATATC AOSATTAAGC TITrAGATCC TriTCCroAC CA-ICTTOITr 
4901 THJUKATIT GCOTATTACT OUXAAAAAA TCAAATACAG 
4951 •nriCACICT TQGAGGAAAT TAAOCATITT CTXTO^ 
5001 TQCACITOCA TTGACAAGAC TTGAAQGACT AAAQGATCTT CGAAGACAAC 
5051 TOGAACTACA TAAAGATCAG ATQGTQGACA TTATCAGAGC TICTCAGGAT 
5101 AATCC3QCAAG ATOaSATTAT GOTGAAACTA OITOTCAATT TCTTOCAOIT 
5151 ATOCAAGATG GCAATAAACC ACACTSC?rSA AAAAGAAGIT CTAGAOQCTC 
5201 TroSAAGCro CTTOOGAGAA OTOOCTOCTA ■mGATTroiC TaoCATAGCT 
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5251 ATACAACATA GTAAAC3ATGC ATCTTATACC AAGGCXXTTTA ACTTATITCA 
5301 AGATAAAGAA CTTCAGTGGA CCTTCATAAT GCTCACCTAC CIXSAATAACA 
5351 CACTCC3TAGA AGATTGTGTC AAAGTTCGAT CAGCAGCTOT TACCTCirTO 
5401 AAAAACATTT TAGCCACAAA GACTCGACAT ACTTICTOCSG AGATmTAA 
5451 GATGACAACA GATCCAATGC TGGCCTATCT ACAGCCTriT AGAACATCAA 
5501 GAAAAAAOTT TITAGAAGTA CCCAGATTTC ACAAAGAAAA CCCTnTCAA 
5551 GGCCTGGATG ATATAAATCT GTGGATTCCT CTAAGIGAAA ATCATCACAT 
5601 TTOGATAAAG ACACTGACTT GTGCTmTr GGACAGTOGA QGCACAAAAT 
5651 GTGAAATICT TCAATTATTA AAGCCAATCT GTCAAGR3AA AAClGACm 
5701 TGTCAGACTG TACTTCCATA CrTGATTCAT GATATTTrAC TCCAAGATAC ' 
5751 AAATGAATCA TGGAGAAATC TGCnTCTAC ACATCTICAG QGATmTCA 
5801 CCAGCTGTCT ICGACACTTC TCGCAAACGA GCOGATCCAC AACCCCIQCA 
5851 AACTTOGATT CAGAGTCAGA GCACTmrc OSATGCIOT 
5901 ATCACAAAGA ACAATGCTTG CTGITCrOGA CEACATCAGA AGACAAAAGA 
5951 GACCTTCTTC AGGAACAATT TTTAATGATG CnTCTOGCT GGATTTAAAT 
6001 TATCTAGAAG TTGCCAAGGT AGCTCAGTCT TCTCCTCCIC ACTITACAGC 
6051 TTTACTCTAT GCAGAAATCT ATGCAGATAA GAAAAGIATC GATCATCAAG 
6101 AGAAAAGAAG TCTTGCATTT GAAGAAGGAA GCCAGAGTAC AACTATOTCT 
6151 AGCrTGAGTG AAAAAAGTAA AGAAGAAACT QGAATAAGIT TACAQGATCT 
6201 TCTCTTAGAA ATCTACAGAA CTATAQGQGA QCXaGATRCT TTOTATCQCT 
6251 GT0C3TQGAGG GAAGATGTTA CAACCCATOA CTAGACiaCG AACAOATGAA 
6301 CAOGAAGCAA TGTCGGGCAA AGC3CCI!RGTA ACATMGACX: TOGAAACftSC 
6351 AATCCCCTCA TCAACACX3CC AGOCAOGAAT CAlTCAQacC TOSCaGAATT 
6401 T3QGACTCTG CCATATICTT TCC3GTCIATO TAAAAOaaT GQATERSQAA 
6451 AATAAAGACT QCTGTCCTGA ACTAGAAQAA OTTCATIftCC AAGCaOC»TO 
6501 GAGGAATATG CAGTCGGACC ATTGCACTTC OGTCAGCAAA GAAOERGAAG 
6551 GAACCAGTTA OCATGAATCA TTOTACAAaXS CICTACMOC TCTAAGRGRC 
6601 AGAGAATTCT CTACATTITA TGAAAGTCTC AAATATOCTA GAGTAAAAGA 
6651 AGTGGAAGAG AroiCTAAGC GCAGCCTTCA OKTOIOTAT TOGCICTMC 
6701 CXaOiCrmG CAGCmxxaiG GCXATIX»AG AGCIXX^ 
6751 CmrcrCAA GATCAGTCAC ACATAGACAA CTCICTGAAG TA7A1MTAA 
6801 GTX3QCAGAAA CACTCOCAGC TTCTCAAOGA CAGTGATPIT ABmrCMXS 
6851 AGCXn-ATCAT GQCTCTAaSC ACAfJICArPT TOtSAGATOCT GATGGAAAAS 
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6901 


GAAATOGACA ACTCACAAAG 


AGAATGTATT 


AAGGACATTC TCACCAAACA 


6951 


CCTTGTAGAA 


CTCTCTATAC 


TGGCCAGAAC 


irrCAAGAAC ACTCAGCTCC 


7001 


CTGAAAGGGC 


AATATTTCAA 


ATTAAACAGT 


ACAATTCAGT TAGCTGTGGA 


7051 


GTCTCTGAGT 


GGCAGCTGGA 


AGAAGCACAA 


GTATTCTGGG CAAAAAAGGA 


7101 


GCAGAGTCTT GCCCTCAGTA 


TTCTCAAGCA 


AATGATCAAG AAGTTQGATG 


7151 
7201 


CCAGCTCTCC 


AGOGAACAAT 
GTGGCAACTG 


CCCAGCCTAA 
GTTAGCAGAA 


AACITACATA CACAGAATGT 


CTGAGGGTIT 


ACGTXXriTAG AAAATOCTCC 


7251 


GGTCATCATG 


CAGACCTATC 


TAGAAAAGGC 


AGTAGAAGTT GCTGGAAATT 


7301 


ATGATOGAGA AAGTAGTGAT 


GAGCTAAGAA 


ATGGAAAAAT GAAGGCATTT 


7351 


CTCTCATTAG 


ccoGGnrrc 


AGATACTCAA 


TACCAAAGAA TIGAAAACTA 


7401 


CATGAAATCA 


TCGGAATTTG 


AAAACAAGCA 


AGCTCTCCTG AAAAGAGOCA 


7451 


AAGAGGAAGT 


AGGTCTCCTT 


AGGGAACATA 


AAATTCAGAC AAACAGATAC 


7501 


ACAGTAAAOG 


TTCAGCGAGA 


GCTGGAGTTG 


GATGAATTAG CCCTGOGTGC 


7551 


ACTSAAAGAG 


GATCGTAAAC 


GCTTCTTATG 


TAAAGCAGTr GAAAATTATA 


7601 
7651 


TCAACTQCTT 


A1TAAGTGGA 


GAAGAACATG 


ATATGIGQGT ATTOCGQCTT 


TCITCCCTCT 


GGCTTGAAAA 


TTCTQGAGTT 


TCTGAAGTCA ATQGCATCAT 


7701 


GAAGAGAGAC 


GGAATGAAGA 


TTCCAACATA 


TAAA'i-rmu CCTCTTATCT 


7751 


ACCAATTQGC 


TGCTAGAATG 


GGGACX!AAGA 


TCATGQGAGG CCTAGGATTT 


7801 


CATCAAGTCC 


TCAATAATCT 


AATCTCTAGA 


ATTICAATGG ATCACCXXXa^ 


7851 


TCACACrTTG 


TTTATTATAC 


TGGCCTTAGC 


AAATGCAAAC AGAGAT8AAT 


7901 


TTCTGACTAA 


ACCAGAGGTA 


GCCAGAAGAA 


GCA8AATAAC TAAAAATGTG 


7951 


OCTAAACAAA 


GCTCTCAQCT 


TGATCAGGAT 


OGAACAGAGG CTGCAAATAG 


8001 


AATAATATGT ACTATCAGAA 


GTAGGAGACC 


TCAGATGCTIC AGAAGTOIT6 


8051 


AOGCACTTTG 


TGATGCTEAT 


ATTATATTAG 


CAAACTTAGA IGOCACTCAG 


8101 


TQGAAGACTC 


AGAGAAAAGG 


CATAAATATT 


OCAGCAGACC AGCX^^TTAC 


8151 


TAAACTTAAG AATITAGAAG 


ATOi'lVriOT 


OCXTTACTATG GAAATTAAOG 


8201 


TQCaAGCACAC 


AGGAGAATAT 


OGAAAOCIQG 


TGACTATACA GTCATTTAAA 


8251 


OCAGAATTTC 


GCTTAGCAGG 


AGGTOTAAAT 


TTACXS^AAAA TAATAGATIG 


8301 


TGTAQCJriCC 


GATGGCAAGG 


AGAOGAGACA 


GCTTGTTAAG GGOCGTGATG 


8351 


AOCTCAGACA AGATGCTCIC 


ATQCAACAGG 


TCTTCCAGAT GTCTAATACA 


8401 
8451 


ITACTCCAGA 
TTATAAGGTG 


GAAACAOGGA 


AACTAGGAAG 
CICAGOGAAG 


AGGAAATTAA CTATCICTAC 


G'lnxxxrrcT 


TGGTGl'lCrr GAAlXJCiroCA 


8501 


CAGGAACICT QOCCATTOGT 


GAATTICITC 


TTAACAATGA AGATOGIGCT 


8551 


CATftAAAGAT ACA0GCX:AAA 


TGATPICAGT 


GOCmCA&r GQCAAAAGAA 
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Figure 6b (vll) 



8601 


AATGATCGAG 


GTGCAAAAAA 


ACirriTlUA 


AGAGAAATAT 


GAAGTCTTCA 


8651 


TGGATGTTTG 


CCAAAATTTT 


CAACCAGTTT 


TCCGTTACTT 


CTGCATGGAA 


8701 


AAATTCTTQG 


ATCCAGCTAT 


TrGGTTTGAG 


AAGCGATTQG 


CTTATACGCX3 


8751 


CAGTC3TAGCT 


ACTTCTTCTA 


1TGTTGGTTA 


CATACTTGGA 


CrrGGlXATA 


8801 


GACATGTACA 


GAATATCTTG 


ATAAATGAGC 


AGTCAGCAGft 


ACTTGTACAT 


8851 


ATAGATCTAG 


GTGTTGCTTT 


TGAACAGGGC 


AAAATCCTTC 


CTACTCCTGA 


8901 


GACAGTTOCT TTTAGACTCA 


CCAGAGATAT 


TGTGGATOGC 


ATGGGCATTA 


8951 


cxixxriXiri'iGA 


AGGTCTCTIC 


AGAAGATGCT 


GTGAGAAAAC 


CATOGAAGTG 


9001 


ATGAGAAACT 


CTCAGGAAAC 


TCTGTTAACC 


ATTXTTAGAOG 


TCCTTCTATA 


9051 


TCATCCACTC 


TTTGACTGGA 


CCATGAATCC 


TITGAAAGCT 


TTGTATTTAC 


9101 


AGCAGAGGCX: 


GGAAGATGAA 


ACTGAGCTTC 


ACCXTTACTCT GAATGCAGAT 


9151 


GAOCAAGAAT 


GCAAACGAAA 


TCTCAGTGAT 


ATIGACCAGA 


GTTTOGACAA 


9201 


AGTAGCTQAA 


OGTGTCITAA 


TGAGACTACA 


AGAGAAACTG 


AAAGGAGTGG 


9251 


AAGAAGGCAC 


TGTGCTCAGT 


GnXaGTQGAC 


AGGTGAATIT GCTCAIT^CAG 


9301 


CAGGCCATAG 


ACCCCAAAAA 


TCTCAGOOGA 


cnrroocAG 


GATGGAAAGC 


9351 


TTGGGTGIGA 


TCTTCAGTAT 


ATCAATTACC 


CTTTC 
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ure 7a 


XX 




FH 


Key 


FH 




FT 


source 


FT 




FT 


CDS 


FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




FT 




XX 




SQ 


Sequence 



Location/Qualifiers 
1. .1303 

/organisms "Homo sapiens" 
122. .1303 
/codoi\_start=l 
/db_xref="PID:g339816" 
/db^xref = -SWISS-PROT: P04637 " 
/note=-p53 antigen? NCBl gi: 339816- 
/gene=-TP53" 
/map=-17pl3.1" 

33 BP; 292 A; 403 C; 348 G; 260 T; 0 other; 
M14695 Length: 1303 July 10, 1998 12:29 Type: N Checdc: 4902 
1 GT0CAGGM3C AGCTAGCTOC 'TOQGCTOCQG GGACACTITC GOITCQGOCT 
51 GC3GAG0GTCC TTTCCACXSLC QGlVSACftCQC rtOCCroGAT TOCSCAQCXaG 

101 i^yrGccTicc GGCfra^ Figure 7b (i ) 

151 CGAG0CXX3CT CTCAOTCAQG AAACATTITC AGAOCTATCG AAACTACITC 
CTGAAAACRA OOTCTGTCC COCTTQCOGT C3CCRAQCAAT GCSATGATITC 
ATOroroCC aSCSAOSATAT TGAACAATOG TrCACTOAAG ACCTAGGICC 



201 
251 
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Figure 7b (11) 

301 AGATGAAGCT CCCAGAATGC CAGAGOCTGC TCCCCXXXnXS GCCCCTOCAC 

351 CAGCGACTCC TACACCGGCG GCCCCTGCAC CAGCCXXXTC CTOGCCCCTC 

401 TCATCTTCTG TCCCTTCCCA GAAAACCTAC CAQGGCAGCT AOGGTITCCG 

451 TCTCCJGCrrc TTGCATTCTG GGACAGCCAA GTCTCTCACT TCCACCTACT 

501 CCCCTCCCXT CAACAAGATG TTITCOCAAC TCQCCAAGAC CTOCCCTOro 

551 CAGCTCTOGG TTGATTCCAC ACOCCOCSCCC QGCACCCGOG TCOGCGCCAT 

601 GGCCATCTAC AAGCAGICAC AGCACATGAC GGAGGITOTO AGGCGCTOCC 

651 CCCACCATGA GCOCTGCTCA GATAGOGATG GTCTOQCCCC TCCICAGCAT 

701 CTTATCaaVG TGGAAGGAAA TTroCOTGTG GACTATTTOG ATCACAGAAA 

751 CACTTTTCGA CATAGTGTGG TGGTGCCCTA TCAGCOGCCT GAGGTTOGCT 

801 CTGACTGTAC CAOCATCCAC TACAACTACA TCTCTAACAG TTCCTOCATC 

851 GGOQGCATCA ACOGGAGQCC CATCCTCACC AlCAOCACAC TGGaAGACTC 

901 CAGTGCTAAT CTACTOGGAC QGAACAGCIT TCAGSIGCGT CTITCTCCCT 

951 GTOCTQGGAG AGACCQGCX3C ACAGAGGAAG AGAATCTCOS CAAGAAAQGG 

1001 GAGCCTCACC AOGAGCTGCC CCCAGQGAGC ACTAAQCXSAG CACTCCCCAA 

1051 CAACACTAGC TCXTCTCCCC AGCCAAAGAA GAAACCACIG GATOGAGAAT 

1101 ATTTCAOXT TCAGATCCGT QGQCX3TGAGC GCTTOGRGAT GITOCGaGAG 

llSl CTGAATGAGG CCTTSGAACT CAAQGAaaJC CAGQCTGGQA AGGAGCXaOG 

1201 GQOGAOCAQG GCTCACICCA GCXaCCTOAA GTCCAAAAAG GSTCAOICIA 

1251 OCTCCOQOCA TAAAAAACTC ATOITCAAGA CAG»AQGGOC TCACICAGAC 

1301 IGA 
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Figure 8a (i) 
! 1.0 

lb = HS498441 standard; RNA; HUM; 8210 BP. 

AC ' U49844 ; 
XX . 

m gl235901 
XX 

DT 20-MAY-1996 (Rel. 47, Created) 

DT 20-MAY-1996 (Rel. 47, Last updated. Version 1) 

XX 

DE Human FRAP- related protein (FRPl) mRNA, complete cds 

XX 

KW 

XX 

OS Homo sapiens (human) 

CX: Eukaryota; Metazoa; Chordata; Vertebra ta; Mammalia; Eutheria; Primates; 

OC Catarrhini; Hominidae; Homo. 

XX 

RN (IJ 
RP 1-8210 

RA Cimprich K.A., Shin T.B., Keith C.T. , Schreiber S.L.; 

RT "cDNA cloning and gene mapping of a candidate human cell cycle 

RT chec3cpoint protein"; 

RL Proc. Natl. Acad. Sci. U.S.A. 93 :28S0-2855 (1996) . 
XX 

RN [2] 
RP 1-8210 

RA Cinprich K.A., Shin T.B., Keith C.T., Schreiber S.L.; 
RT 

RL Sutmitted (22-FEB-1996) to the OffiL/GenBank/DDBJ databases. 

RL Karlene A. Cinprich, Chemistry, Harvard University, 12 Oxford Street, 

RL Cambridge, MA 02138, USA 

XX 

DR SPTR£MBL; Q13535; Q13535. 
XX 

FH Key Location /Qualifiers 

FH 

FT source 1. .8210 

FT /organisms "Homo sapiens" 

FT /chromosomes " 3 • 

FT /cell_type=-Jurkat T-cell" 

FT /map=-3q22-q24- 

FT CDS 106. .8040 

FT / codon^s tar t = 1 

FT /db.j>a:ef = ■ PID : gl23 5902 • 

FT /db_pcref=-SPTREMBL:Q13535- 

FT /notes -similar to FRAP, Meclp, Torlp, aVDr2p, and ATM- 

FT /gene=-FRPl- /TTR 

FT /products "FRAP-related protein* 

FT /translations •MGEHGLELASMIPALRELGSATPEEYNTWQKPRQILCQFIDRIL 

FT TDVNVVAVELVIOCroSQPTSVMLLDFIOHIMI^ 

TRLLRIAATPSCHLLHKKICEVICSLLFUKSKSPAIFX3VLTKELLQLF 
V^OIAVEWPVVMSRFI^OTJ5E{lMrTVT ri<; JV W ATCMnwr .'mrm m.rijnrr moTTfi -rt rm 



FT 
FT 
FT 



VMaiAVE>yPVVMSRFl,SQLDEtoKMiOSAPI/3t^^ 
^ ^^^^^^^^^ IG^^^^ ^^LEYGSPKIKSLAISFXTE^ 

FT FMTTOOLKLYEEPLSKLIKTLFTFEAEAYRNIEPVYIIJMLLEKLCVMFT^ 
^ LI-KAALCHLUjyFl-KFVPAGYESALQVR^ 
FT LKMESMEIIEETOCrmv^T^m "CCMcn-r^TCtnrDDDT eee^ 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



LKMESMEIIEEIQCQTQQEin^SNSIXSISPKRRRLSSSIOTSKRAPKQTEEIKH^ 
KSIUVSAIJCQKAESLQISLEYSGLKNPVIEMLK3IAVVLQLTALCTVH(::SH^^ 
KKXJHKSKKKPSVVITVMSLDFYIKVLKSCRSLIuESVQKLDLEAT 
QVNSSFTOHILEDLOaiLSLPWIYSHSDIXXXKLTTFAANLLTLSCRISDSY 
CVFl-LTLFPRRIFl.E>n^TAVYNWALQSSHEVIFiASCVSGFTILUXX3NSCNRVPK^ 
KVKIX>SDIVKKEFASIU3QLVCTLHGMFyLTSSLTEPFSEHGHVDLFX3U^^ 
SSSQLKASVCKPFl,FLIJCKKIPSFVKLAFmJLHHLCKHUDFT^ET^^ 
MgPDKDVRV7^SGNIKHILESLDSEIXSFIKELFVI.RM^^ 

TKa>IGRAAKGDLVPFALUILLHaXSKSASVSGAAyrEIRALVAAKS\^^ 

KPIOQn.VESLHSSQIfrALPOTTOQI0U>^ 

FliTimXMiPDLAAKASPAASALIRTUaCQLNWna^^ 
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Figure 8a (11) 

FT LER^UmjCl^EIELGSLLRQDFOGLHNEU.LRIGEHYOOV^ 

FT QGPIU)IISPEIJ4ADYWPKl^IlJU^FNMQU^SSVGim 

FT VSSVRVKMMTTLRTGLRFKDDFPEIXCRAVCCF\mCU)HAC^ 

FT QPKETAAIFHYL 1 1 ETOTDAVQDFUIEI YFLPDHPELKKIKAVLQEYRKETSESTDLQTT 

FT LQLSMKAIQHEIWDVRIHALTSUCETLYKNQEKLIICfATDSETVEPIISQL\^^ 

FT QnANSOARLUXSECI/SELGAIDPGRLDFSTTETQGKDFTFVm^V^ 

FT AYIAYAENSRAQDSAAYAIQELLSIYTCREMETOGPGHQIA^mRFPEH^ 

FT YKSSQKSTCWSGVKKPIYLSKliGSNFAEWSASWAGYLITKVmHDlJ^KIFT^ 

FT DFKVTIYLLPHILVYVLLGCNQEEXX?EN^AEIMAVIJaiDTO 

FT TVFSmJDHLTQWARHKFQAIJCAEKCPHSKSNRNKVDSMVSTW PQ 

F^^ imAVT^FRSKAYTRAVMHFESFITEKKQNIQEHLGFXQKLYAAMHEPDGVAG^^ 

FT AEPSUCBQILEMESLGLUUDATACYDRAIQLEPrX?! IHYHGWKSMLGI^ 

FT NG^W^NRSE^m)ELNTYRVEAAVOT^SQWDLVENYIA^^ 

FT ITAryDSUa.VRAEQIVPLSAASFERGSYQRGYEYI\mLHMLCELmSIKPLFQHSPGD 

FT SSQEDSUm/PJ^Etmi^USYW^J^llJ^^ 

FT AGHHQTAYNALIiOAGESRlAELYVERAKWl^SKGDVmQALIVLQKGV^^ 

FT GKN^ILIHGRA^^JLVGRFMEETANFESNAI^^aCYKI)^ 

FT MVmX^KMEKQGDLIRYIVUIPGRSLQYGNQFIYQSMPRjMLTI^^ 

FT DRVQMRNDIiGKINK\rtTEHltmAPYQFT.TAFSQLISRICHSHDEVFWlJffi lAKVFL 

FT AYPQQAMWMm'AVSKSSYPMRVNRa^II^nCAIHMKKSLEK^^ 

FT PVreSSSTLSMSTHFKMUaaVEEATFSEILIPLQSVMIPl^ 

FT GHWAYIAGFDIMVEIU^LQKPKKISIJCGSIXSKFYIMMCKPra 

FT KCLRKDAESRRRELHIRTYAVI PLNDECGI lEWVNNTAGLRPILTKLYKEKGVYMTGKE 

FT LRQCMLPKSAALSEKLKVFIlEFTiLPRHPPIFHEWFLRTFPDPTSWYSSRSAYCR 

FT SMVGYII^LGDRHGENILFT)SLTGEC\mVDFNCLFmGETFEVPEIW 

FT \ GPMGTEGLFRRACEVmiRLMRIXJREPLMSVUC^ 

FT VVNEKAKTHVLDIEQRLCXSVIKTRNRVTGLPLSIBGHV^ 

FT PYM" 

XX 

SQ Sequence 8210 BP; 2511 A; 1555 C; 1738 G; 2406 T; 0 other; 

U49844 Length: 8210 July 10, 1998 12:08 Type: N Check: 4511 

1 GCCTOCACAC GGCTCOGTCG GGOGCCGCGC TCTTCOGQCA GCGGTAQCTT 

51 TGGAGACGCC GGGAACXXXX: GTTGGCGTQG TrGACTAGTG CXTTOGCAGCC 

101 TCAGCATGGG GGAACATGGC CTGGAGCTGG CTTOCATGAT CXXZCGCCTTG 

151 CGGGAGCTGG GCAGTGCCAC ACCAGAGGAA TATAATACAG TTGTACAGAA 

201 GCCAAGACAA ATTCTGTGTC AATTCATIGA OCGGATACIT ACAGATSTAA 

251 ATGITCITCC TCSTAGAACIT GTAAAGAAAA CIGACTCICA GCCAACCTCC 

301 GTSATGTIGC TTGATITCAT OCAGCATATC AT8AAAT0CT CSOOCACITAT 

351 GITIGTAAAT GTQAGIGGAA GCCATGAGGC CAAAGGCAGT IGTATTGAAT 

401 TCAOTAATTG GATCATAACX5 AGACTTCTQC GGATTOCAQC AACTOCCTCC 

451 TGTCATITGT TACACAAGAA AATCTSTQAA GTCATCTGTT CATTATTATT 

501 TCimTAAA AGCAAGAGIC. CTOCTTlTTPr TQQGGTACIC ACAAAAGAAT 

551 TATTACAACr TITIGAAGAC TTQGTTrAOC TCCATAGAAG AAATGTGATG 

601 OC31CA1GCIG TQGAATSGOC AGTGGTCATG AGCXXSATTTT TSUtfnCAATT 

651 AGATCAACAC ATQGGATATT TACAATCAGC TCCITTGCAG TTGATGAGTA 

701 TGCAAAATTT AGAATTTATT GAAGTCACiT TATTAATCaGT TCTTACIOGT 

751 ATTATIGCAA TTGICTTTTT TAGAAOQCAA GAACICPEAC TTIGGCAGAT 

801 AGCmxnCTT'Croi^^ AAAAATTAAA- TSOCTA G GAA 
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Figure 8b (11) 

851 •n'AGCrrm MCAGAACTT TTTCAGCTro 

901 GCTAcx»ciT TmcAcxrrc ATirno^ 

951 MTGGATACT acCMTTCA MCICTATOV AGAGCCAm 
1001 TAAAGACACT ATmXCTTT GAAGC^GAAG OTATAGAAA TAT^^ 
1051 GICTATITAA ATATGCTCOT GOU^AAACIC TXn^ 
1101 TCTSCTCATC CC3GCTTAAGT CTCaTI^ 
1151 TACroCACTA TITCXriTAAA TTK^^ 

1201 CAAGTCAGGA AGGTCTATCT GAGAAATATT TCTAAAGCTC nTTOGATOr 

1251 QCnXXSiATT C3A<OTAGATC CACaCTAOT (mOS^^ 

1301 CTITCAAAAT GGAAACTATG GAAATCATTC AGGAGATTCA AT^^ 

1351 CAACACX5AAA ACX:iO«Xaw; TAATACnCAT GGAATA^^ 
• 1401 -KXnCICAGC TCCm^CTAA ACCCl^ 

1451 '^GGAAATTAA ACATGTGC3VC ATCAACCAAA AG^ 

1501 CTCAAACAGA AAQCTOAATC CCTTCAGATT KXXTtGlAT ACAOTOGCCT 

1551 AAAGAATCCT GTTATTGAGA TOmOAAGG AATTOCICTT CTCTTACAAC 

1601 TGACTGCTCT GTOTACTOTr CATICTICTC ATCAAAACAT GAACTOCOGT 

1651 ACTITCAAGG ACTCnCAACA TAAATOOU^G AAGft^^ 

1701 AACOTQGATC TCATTOGATT TTTACACAAA AGl«nTAAG AGC^^ 

1751 OmxnTAGA ATCTCTTCJkG AAACl^^ 

1801 GTQOTGAAAA TTTATCATOC TITCATmT ATCCAAGTAA ACAGITCAOT 

1851 TCAAGATCAT ATCrrOGAAG ATmTtMGG TATOCl^ 

1901 TmiTCOOV TICTOATGAT GGCTOTITW^ 

1951 AATCOTCTAA CAOTAAGCTX; OAQGATTrav GATJU^^ 

2001 ACAATOOGA TGTGTGmC TTCTOAC^ 

2051 -ITCAGTOGAG AACJ«X3«nT TACAACTOQG 

2101 GI»^TOX»G CXaGITGICT TAtaGGATTT 

2151 GAAWV.-.W1- AACAGAGITC CCaAGATTCT TATMSATAAA GICAAAGATC 

2201 ATOTCACAT TGTOU^ GAATITSCIT ™ 

2251 TCXACWnrc AOGQOVTOIT TIATCTGACA AGl^^ 

2301 CICTGAACAC QGACATCTOG AOCTCTTCIG TAQGAACITC AAAGCCACIT 

2351 CICAACATGA ATCnrCATCT TCTO^ACTAA AAQC^^ 

2401 ^nriTO TACTGAAAAA AAAAATAOCT AGTCCftG^ 

2451 CAOauaTOAT OTtf^TCATC TITOlftAOa 

2501 AAAOWaTCT AAAAGCAOT CITOSAACIT TATTAAA 
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Figure 8b (111) 

2551 CCAGACAAAG ATCTTAGAGT QGCTTITAGT GGAAATATCA AGCACATATT 

2601 GGAATCCnx; GACTCTGAAG ATQGATTTAT AAAGGAGCTT TITCICmA 

2651 GAATGAAQGA AGCATATACA CATCXXCAAA TATCAAGAAA TAATCAGCTC 

: 2701 AAGGATACCT TCATTCTTAC AACAGGC3GAT ATTGGAAGGG CXX3CAAAAGG 

2751 AGATTTCGTA CCATTTCCAC TCTTACACTT ATTGCATTCT TTCTTATCCA 

2801 AGTCAGCATC TCTCTCTGC3A GCAGCATACA CAGAAATTAG AGCTCTCGTT 

2851 GCAGCTAAAA GTGTTAAACT GCAAAGTTTT TICAGCCAGT ATAAGAAACC 

2901 »TCTGTCAG TITriX3GTAG AATCCXTITCA CICTAGTCAG ATCACAGCAC 

2951 TTOCGAATAC TCCATQCCAG AATGCTGACG TOCGAAAACA AGATCTCGCT 

3001 CACCAGAGAG AAATGOCTTT AAATACX3TPG TCTCAAATK5 CCAACGTITr. 

3051 CGACTTTCCT GATCTTAATC GTmCTTAC TAGGACATTA CAACTICTAC 

3101 TACX:TGATCT TGCTGCXAAA GCAAGCCCTG CAGCnCIGC TCTCATTCGA 

3151 ACTTTAGGAA AACAATTAAA TGTCAATCGT AGAGAGATTT TAATAAACAA 

3201 CTTCAAATAT ATTmTCTC ATTTGGTCTG TICITCITCC AAAGATCAAT 

3251 TAGAAOGTCC OCTTCATTAT CTGAAGAATG AAACAGAAAT TCAACTCQGG 

3301 AGCCTGTTGA GACAAGATTT CCAAGGATTG CATAATCAAT TA1TCCIX3CG 

3351 TATTQGAGAA CACTATX::aAC AGGnTTTAA TCGITTCTCA ATACITCCCT 

3401 CATTTGCATC CAGTGATGAT OCATATCAGG GCCCGAGAGA TATCATATCA 

3451 CXTOAACTGA TGGCTGATTA TTTACAACCC AAATTCITCG GCATTTIGGC 

3501 TTTrTTTAAC ATOCAGTEAC TGAGCTCTAG TGTTGQCATr GAAGATAAGA 

3551 AAATCGOCIT GAACAGTITG ATGICITTGA TCAAGTTAAT GGGACXTAAA 

3601 CATSTCAGTT CTCTGAQGGT GAAGATGATG ACCACACTGA GAACTOGOCT 

3651 TCGATICAAG GATCATTITC CTGAATTGTG ITGCAGAGCT TCQGACTOCT 

3701 TTOTTOGCIG CCTQGATCAT GCTTGTCTOG GCTCCCITCT CACJICATOTA 

3751 ATAGTAGCTT TX31TA0CTCT TATACACATC CAGCCTAAAG AAACTOCAGC 

3801 TATCTTOCAC TACCTCATAA TTGAAAACAG QC3ATGCIGTO CAAGATITIC 

3851 TTCATGAAAT ATATTmTA CXTIGATCATC CAGAAOTAAA AAAGATAAAA 

3901 GOOGITCTCC AGGAATACAG AAAGGAGACC TCIGAGAGCA CTCATCTTCA 

3951 GACAACTCrr CAGCICICTA OGAAQQCCAT TCAACATCAA AATCTOGATC 

4001 TTOSTATICA TOCICTTACA AGCITCAAGG AAAOCTTOTA TAAAAATCAG 

4051 GAAAAACTOA TAAAGTATGC AACAGACAGT GAAACAGTAG AACCTATTAT 

4101 CTCACAGTT5 GTGACAGIGC TriTGAAAQG TTQCXAAGAT QCAAACTCIC 

4151 AAOCT00C3TT GCTCiglWU GAATCITEAG GOGAATTOOG GCSOSATAGAT 
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Figure 8b (Iv) 

4201 CCAGGTCGAT TAGATTTCTC AACAACTGAA ACTCAAGGAA AAGATTITAC 
4251 ATrrGTGACT GGAGTAGAAG ATTCAAGCTT TGCXTTATCQA TTATTOATOG 
4301 AGCTAACAAG AGCTTACXnT GCGTAOGCTG ATAATAGCXX3 AGCTCAAGAT 
4351 TCAGCTCCCT ATOCCATTCA GGAGTTGCTT TCTATTTATG ACTGTAGAGA 
4401 GATGGAGACC AACGGCCCAG GTCACCAATT GTGGAGGAGA TITCCroAGC 
4451 ATX3TTCGGGA AATACTAGAA CCTCATCTAA ATACCAGATA CAAGAGTTCT 
4501 CAGAAGTCAA CCGATTGGTC TCGAGTAAAG AAQCX::aATTT ACTTAAGTAA 
4551 ATrGGGTAGT AAClTrGCAG AATGGTCAGC ATCTTOGGCA GGrTATCTTA 
4601 TTACAAAGGT TCGACATGAT CTTCCCAGTA AAATTTTCAC CTCCTOTAGC 
4651 ATTATGAOGA AGCATGATTT CAAAGTGACC ATCTAOCTTC TTCCACATAT 
4701 TCTQGTCTAT GTCTTACTGG GITCTAATCA AGAAGATCAG CAQGAGGTIT 
4751 ATGCAGAAAT TATGGCAGTT CTAAAGCATC AOGATCAGCA TACCATAAAT 
4801 AOOCAAGACA TTGCATCTGA TCTXSXGTCAA CTCAGTACAC AGACroiGrr 
•4851 CTCCATCCTT GACCATCTCA CACAGTOGGC AAQGCACAAA TTICAGGCAC 
4901 TGAAAGCTGA GAAATGTOCA CACAGCAAAT CAAACAGAAA TAAGGTAGAC 
4951 TCAAOGGTAT CTACTGTQGA TTATGAAGAC TATCAGAGTG TAACCCGTTT 
5001 TCTAGAOCTC ATACCCCAGG ATACTCTQGC AGTAGCTTCC TITOGCICCA 
5051 AAGCATACAC AOGAGCTGTA ATQCACTITG AATCATTEAT TACAGAAAAG 
5101 AAGCAAAATA TTCAGGAACA TCTTOGATTT TTACAGAAAT TGTATOCIGC 
5151 TATOCATGAA C3CTGATOGAG TQGOOQGAGT CAGPTOCAATT AGAAAGGCAG 
5201 AAOCATCICT AAAAGAACAG ATCCITGAAC ATCAAAGOCT TOQCTTGCXG 
5251 AGQGATOOCA CrSCTTGTTA TGACA0C3GCT ATICAQCTAG AACCAGAOCA 
5301 GATCATICAT TATCATOGTC TAGTAAAGIC CMCTEAQOT CTrGGICftQC 
5351 TSXCTACTOT TATCACICAG GTGAATGGAG TCCMGCTAA GAa?ICaGAG 
5401 T9GACAGATG AATTAAACAC GTACAGACTG GAAGCAQCTT QGAAATIGIC 
5451 ACAGIOQCSAT TTOCTOCSAAA ACTATPEGQC AGCAGAOOG^ AAAITCIACAA 
5501 GATOGAGIGT CAGACTSGGA CAGCTRTEAT TA!rCAQCX3A AAAAAGAGAT 
5551 ATCACAGCTT TTEATCACIC ACTOAAACTA GIGAGAQCaG AACAAATICT 
5601 AOCTCmCA GCTGCAAGCT TTGAAAGAGG CTOCTACSCAA CX3AGGATOTC 
5651 AATATATIGT GASATTOCAC ATOTTATOTG ACTTQGAGCA TAOCATCAAA 

5701 ocAcrmoc AGCATTCTCC aostoacagt tctcaagaag attctctaaa 

5751 CroOGTAGCT CGACTAGAAA TGAOOCAGAA TTOCTACAGA QOCAAOaAOC 
5801 ClATOCroOC TCTOCX9GAOG GCTTTACTAA OCSCKIAACAA AAGAOCAGAT 
5851 TACAAIGAAA TOGTIGSAGA ATCSCTOQCIG GAGAOZOC9CA GC9CTAOC33VG 
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Figure 8b (v) 

5901 AAASGCTC3GT CACXZACCAGA CAGCCTACAA TCCTCTCCIT AATX3CAGGGG 

5951 AATCACGACT OGCTGAACTG TACGTOGAAA GGGCAAAGTG GCTC TO GTCC 

6001 AAGGGTGATG TTCACXy^QGC ACTAATTGTT CTTCAAAAAG GTCTTCAATT 

6051 ATGTTTTCCT GAAAATGAAA CXXTACCTGA GGGTAAGAAC ATCTTAATCC 

6101 ATOGTCGAGC TATSCTACTA GTCGGCCGAT TTATQGAAGA AACAGCTAAC 

6151 TTTSAAAGCA ATGCAATTAT GAAAAAATAT AAGGATGTGA CCGCGTGCCT 

6201 GCCAGAATCG GAGGATGQGC ATnTTACCT TGCCAAGTAC TATGACAAAT 

6251 TGATGCXXIAT GGTCACAGAC AACAAAATGG AAAAGCAAGG TSATCTCATC 

6301 CXOTATATAG TTCrrCATTT TCGCAGATCT CTACAATATG GAAATCAGTT 

6351 CATATATCAG TCAATGCX:AC GAATCTTAAC TCTATQGCTT GATTATCGTA 

6401 CAAAGGCATA tgaatgggaa aaagctgqcc gctccgatcg tgtacaaatc 

6451 aggaatsatt tgggtaaaat aaacaaggtt atcacagagc atacaaacta 

6501 tttagctcca tatcaatttt tgactgcttt ttcacaatrg atctctcgaa 

6551 tttgtcatrc tcacgatcaa gtmtgttg tcttgatoga aataatagcc 

6601 AAAGTAITTC TAGCCTATCC TCAACAAGCA ATGTGGATGA TGACAQCTOT 

6651 GTCAAAGTCA TCTTATCCXA TGCX3TGTGAA CAGATGCAAG GAAATCCTCA 

6701 ATAAAGCTAT TCATATGAAA AAATCCTTAG AGAAGTnGT TGGAGATGCA 

6751 ACTCGOCTAA CAGATAAGCT TCTAGAATTG TGCAATAAAC CGGTTCATGG 

6801 AAGTAGTTCC ACATTAAQCA TCAGCACTCA TTTTAAAATG CTTAAAAAGC 

6851 TOGTAGAAGA AQCAACATTT AGTCAAATCC TCATTOCTCT ACAATCAGTC 

6901 ATGATACXrrA CACTTOCATC AATTCTGGGT ACCCATGCTA ACCATGCTAG 

6951 OCATGAAOCA TTTCCTQGAC ATIGGGCX^TA TATTCCAGQG ITTGATGATA 

7001 TQC31X3GAAAT TCTTGCrTCT CTTCAGAAAC CAAAGAAGAT TICTTTAAAA 

7051 QQCTCAGATS GAAAGTTCTA CATCATGATG TGTAAGOCAA AAGATGACCT 

7101 GAGAAAQGAT TGflMACTAA TQGAATICAA TrCCTTGATT AATAAGTGCT 

7151 TAAGAAAAGA OOCAGAGTCT OGTAGAAGAG AACTTCATAT TOGAACATAT 

7201 QCftGnATTC OWnAAATQA TCAATOIGQG ATTATTGAAT GQGTGAACAA 

7251 CACroCTQGT TTGAGAOCTA TICIGAOCAA ACTATATAAA GAAAAGGGAG 

7301 TOEATAOXaVC AGGAAAAGAA CTTOGOCAGT GIATQCTACC AAAGTCAQCA 

7351 GCTXTATCTG AAAAACTCAA AGTATTGOGA GAATnCTOC TCOCCAGGCA 

7401 TCCTOCTATT TTICATCAGT GGTriCTGAG AACATTCOCT GATOCTACAT 

7451 CATOGTACAG TAGTAGATCA GCTTACTQCX: GTTOCACTGC AGIAA7G1CA 

7501 ATOOTTOGTr ATATICIQGG GCTTOQAGAC OCnCATOGTO AAAATATTCT 
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Figure 8b (vl) 

7551 crrrGArrcT rrGACTGGTG 

7601 TCAATAAGGG AGAAACCTTT 

7651 ACTCATAATA TGGTTAATC3G 

7701 TCGAAGAGCA TGTGAAGTTA 

7751 CTTTAATGAG TGTCTTAAAG 

7801 AGTAAACCAG TGAAAGGGCA 

7851 AGTTCTCAAT GAAAAGGCCA 

7901 TACAAGGTCT AATCAAGACT 

7951 ATTGAAGGAC ATGTGCATrA 

8001 ACTATGCCAG ATGTATCTTG 

8051 TGTAAAAGAA TATGTTAATA 

8101 GTQGTTGTAT CTGTTCAATT 

8151 CAACTGTTAT TTCTCTCTGA 

8201 CACTCXTIGCC 



AATGCGTACA TGTAGATTTC AATTGTCnT 
GAAGTTCCAG AAATTGTGCC PiVTTCGCCTG 
AATGGGTCCT ATGGGAACAG AGGGTCTTTT 
CAATGAGGCT GATGCGTGAT CAGCGAGAGC 

AcnrrcTAC atgatcctct tctggaatcg 

TTCCAAAGCG CCACTGAATG AAACTGGAGA 
AGACXCATGT TCTTGACATT GAGCAGCGAC 
CGAAATAGAG TGACAGGACT GCCGTTATCT 
CCTTATACAG GAAGCTACTG ATGAAAACTT 
GTTGGACTCC ATATATGTGA AATGAAATTA 
ATCTAAAAGT AATCCATTTG GTATGAATCT 
CTAAAGTACA ACATAAATTT ACGTTCTCAG 
TCATTAATTA TATGTAAAAT AATATACATT 
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! 'NA^SBCXJnrcE 1.0 

HS349941 standard; RNA; HUM; 12780 BP. 

AC U34994; 
XX 

NI g995940 
XX 

ur 26-SEP-1995 (Rel. 45, Created) 

22.FEB-1997 (Rel. 51, Last updated. Version 4) 

DE ST ^?:?rcS' "^'"^^^ ^^'^ ^^-PKcs, 

XX 

KW 

XX 

OS Homo sapiens (human) 

CX: EuJcaryota; Metazoa; Chordata; Vertebi-ah^ . m=.«,^=i;, t:^ 

OC Catarrhini; Hcaninidae; Homo ^^^^ebrata, Mairroalia; Eutherxa; Primates;- 
XX 

RN (U 

RP 1-12780 

RX MEDLINE; 95401275. 

RL Cell 82:849-856(1995). 
XX 

FN [2] 

RP 1-12780 

RA Cell D.; 
RT 

S: i^ri?? <29-AUG-1995) to the EMBL/GenBank/DDBJ databases 

Dave Gell. Zoology. Wellcome/CRC, Tennis Court Road, bridge CB2 IQR. 

XX 

ER SPTOEMBL; Q13327; Q13327. 

OC NCBl gi: 995940 
XX 

5? ^ Location/Qualifiers 

FH 

FT source i. .12780 

f]E /organisms "Homo sapiens* 

/chranoscme= ■ 8 ■ 
^ /cell_type=- He-La- 

^ /inaps="8Qll" 

^ CDS 53. .12343 

FT /codocustart=l 

li /dbjcrefe-piD:g995941" 

li /db^Xref s • SPTREMBL : Q13 327- 

FT iS^^r^Tt''^^'^^!^ protein kinase catalytic subunit; PI 
FT 1. f^^^ inanber; partial genomic sequence located in 

FT G«Bank /U:cession^l^ L27425; MethodPcOTc^t^ 

FT 

FT 



/f^f-r^^.^f^"^^^ protein )cinase catalytic subunit" 



FT 
FT 
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KWKvmra 

^AKTEFWVYSFSYELIUJSTRLPLISGFYKIXSITVRNAKKIKYF^ 

MAFKU3LSYTPIAEVGIJaALEIWSIYIDRHVMQPYYKDILPCUXm-K^^ 

^^f^S^^'^'^'^^^'^^^^PVIFUJVnT.PRVT^ 




v» 1 wu. Y:>iJU.HPNAFKJU^ASlAFmiYREFREEESLVEOFWEALW^ 
MESIALAHADEKSUn-IQQCCmiDHI^IIEKKHVSIJ^KASLPraFre 
DUJiajLI^CGRPQTBCRHKSIELFYKF./PLL^ 

GGGOC^PSGIIAQPTLLYLRGPFSIXJATIOTLDUJAALECVIOT 

MKAlJ<MSPYKDILErimREKITAQSIEELCAVNLYGPnAOVDROTA^ 

GUJlNILPSOSTDUfflSVGTEIJ^LVYKGIAPGDERQCLpSsraOI^ 
PGGLCE3U>VSLLLNPA\rt.STA«;i/5RQrif^«nrr«c.cur.^3i:^SiZ^^ 




FFTSLTGGSLEELRRVLEQLIVAHFPMQSH£FPPGTPRF^?5^^;^2S 
M^EUfTEVUrREQQHVMEELFQSSFRRlARRGSOmjV^^sS^^ 
ROSFVDRSIiTIXWHCSU)AIJ^STIVVnAIDVUCS 
KIIXWHYSRLPKDDVHAKESKINQVFHGSCITBGNaTOTWn^ 
LERRRLVHCAAYNCAISVICCWWEUCFVOGFLFSEKPHoi^ 
EVEVPMERKKKYIEIRKEAREAANGDSDGPSVMSSI^yS^^SSS^ 



ysyssqdprpatgrfrrreqrdptvhddvlelemdelnrS^^^SS^ 

PQGEEDSVPRDLPSWMKFUlGKI/aaPIWUaRIJLAKIA^^ 

YDPQCGIOSSEYFQALVNNMSFVRYKEVYAAAAEVLGLIUIYVMERI^^ 

BarreiYFQLKSKDFVQV^^ 
STraiE(»IYm:uiV«HnmU)PESETKJDSOETFWA 



STTO^QMYOTLMWIHD^ 
KKRl/n-PGDEVimVKGAAGRTDIJJUJyWIMRi» 



SCKF^fiM^mSARKON^ 

imwmADK3>MUyCEEENAS\m3SAELQAYPALVVEI^^ 

AIVYPFIISSESYSFFOTrj^ITSnfMKTrCTraijTircn^ twv^ J^rJS,™,- Z^ZZJl 




i^^SS^ii:^'^^^"^S^'^'^SDPRAPPa^^ 

INV»Emm'RQKICYAKRKlAGAm>A\aTCDEUJU3HEKAPAF^ 
Sequence 12780 BP, 3612 A; 2769 C; 3084 G; 3314 T; 1 other; 
4994 Lengths 12780 July 10. 1998 12:15 Type: N Check: 8189 ..Figure 9b(l) 
1 ATTTCOOQOT O0QC3GO0GAG OQQOOCSCaCO OtSOQGGAGOS 0QR01C0008 
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51 


GCATGGCGGG 


CTCCGGAGCC 


GGTC3TCCGTT 


GCTCCCTCCT GCGGCTGCAG 


101 


GAGACCTTGT 


CCGCTGCGGA 


CCX3CTCCQGT GCTGCCCTGG CCGGTCATCA 


151 


ACTGATCXX3C 


GGCCTGGGGC 


AGGAATGCGT 


CCTGAGCAGC AGCCC0GCX3G 


201 


TGCTGGCATT 


ACAGACATCT 


TTAGTTnTT 


CCAGAGATTT CXlXiTnXSClT 


251 


GTATITCTCC 


GGAAGTCACT 


CAACAGTATT 


GAATTTCGTG AATCTAGAGA 


301 


AGAAATCCTA 


AAGTTTTTAT 


GTA'lTl'lCnT 


AGAAAAAATG GGCCAGAAGA 


351 


TCGCACCTTA 


CTCTGTTGAA 


ATTAAGAACA 


CTIGTACCAG TGTTTATACA 


401 


AAAGATAGAG 


CTGCTAAATG 


TAAAATTCCA 


GCCCTQGACC TTCTTATTAA 


451 


GTTACrrCAG 


ACTTTTAGAA 


GTTCTAGACT 


CATQGATGAA TTTAAAATTG 


501 


GAGAATTATT 


TAGTAAATIC 


TATGGAGAAC 


TTGCATTGAA AAAAAAAATA 


551 


CXIAGATACAG 


TTTTAGAAAA 


AGTATATGAG 


CTCCTAGGAT TATTGGGTGA 


601 


AGTTCATCCT 


AGTGAGATGA 


TAAATAATGC 


AGAAAACCTG ITCCGCGCTT 


651 


TTCTQQGTGA 


ACTTAAGACC 


CAGATGACAT 


CAGCAGTAAG AGAGCCCAAA 


701 


CTACCTCTTC 


TQGCAGGATG 


TCTGAAGGGG 


TTGTCCTCAC TTCTGTGCAA 


751 


CTTCACTAAG 


TCCATGGAAG 


AAGATCCCCA 


GACTTCAAGG GAGATTnTA 


801 


ATTTTGTACT 


AAAGGCAATT 


CGTCCTCAGA 


TTGATCTGAA GAGATATGCT 


851 


GTGCCCTCAG 


CTQGCTTOCG 


CCTATTTGCC 


CTGCATGCAT CTCAGTTTAG 


901 


CAOCTGCCTT 


CTGGACAACT 


ACGTCTCTCT 


ATTTGAAGTC TTGTTAAAGT 


951 


GGTSTCSCXXyv 


CACAAATCTA 


GAATTGAAAA 


AAGCTGCACT TTCAGCCCTG 


1001 


GAATCCITTC 


TGAAACAQGT 


1TCTAATAT6 


GTGGCGAAAA ATGCAGAAAT 


1051 


GCATAAAAAT 


AAACTGCAGT 


ACTITATGGA 


GCAGTITTAT GGAATCATCA 


1101 


GAAATGTG6A 


TTOGAACAAC 


AAGGA5TTAT 


CTATTGCZAT OOGTOGAIAT 


1151 


GGAcmnG 


CAOGACXXnXS 


CAAGGTTATA AAOGCAAAAG ATOTTGACTT 


1201 


CATGTACXalT 


GAGCTCATTC 


AGOGCTQCAA GCAGATGITC CTCACOCAGA 


1251 


CAGACACTGG 


TGACTACOGT 


GITTATCAGA 


TGCX^AAGCFT GCTOCAGXCT 


1301 


GT3X3CAAGCG 


TCTTGCTGTA CCTTGACACA GTTCC3XSAGG TSIATACTCC 


1351 


AG'i'lC'lUGftG 


CACCTCGTOG TCATCCAGAT AGACAGTTTC 


1401 


GTOCAAAAAT 


GCAGCIGGTG 


TGTTGCAQAG 0CATAGTX3AA GGTSTTOCTA 


1451 


GCTTTOGCAG 


CAAAAOGGCC AGTTCTCAG6 AATTGCATTA GTACTCTQGT 


1501 


GCATCAOGGT 


TTAATCAGAA TAICTICTAA ACCAGTGGTC CTP0CAAA06 


1551 


GOCXTGAGTC 


TGAATCTGAA GAOCAOOGTG 


CTTCAOGGGA AGTCAGAACT 


1601 


GGCAAATQGA 


AOGIX300CAC 


ATACAAAGAC 


TA0GTOC3ATC TCTTCAGACA 


1651 


TCTOCTGAOC 


TCTC3A0CAGA TGATOGATTC TATTITAOCA GATGAAOCAT 
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1701 TmCTCTCT GAATTCCTCC AOTCAAACSTC TCAATCATTT ACTTTATCAT 
■ 1751 GAArrTGTAA AATCCCTmT GAAGATTOTT GAGAAATIOG AlCTTACACT 
1801 TGAAATACAG ACTCTTGGGG AACAAGAGAA TOGAGATCAG GCGCCTOCIC 
1851 TTTGGATGAT CCCAACTTCA GATCCAGOK CTAACITCCA TCCAGCTAAA 
1901 CCTAAAGATT TITCGGCTrT CATTAACCTG GTOGAATITr GCAGAGAGAT 
1951 TCTCCCTCAG AAACAAGCAG AATTrTITCA ACCATOQGTO TACICATTIT 
2001 CATATCAATT AATTTTCCAA TCTACAAGCT TCCCCCICAT CAGIXSCOTTC 
2051 TACAAATTOC TTTCTATTAC AGTAAGAAAT GCCAAGAAAA TAAAATATTT 
2101 CGAGGGAGTT AC3TCCAAAGA GTCTGAAACA CTCTCCTOAA GACCCAGAAA 
2151 AGTATICTrG CirrGCTITA TTTOTGAAAT TTCQCAAAGA GGTOQCAGIT 
2201 AAAATCAAGC AGTACAAAGA TCAACTTITC QCCICProrr TCACCTTTCT 
2251 TCTGTOCITC CCACACAACA TCAITCAACT CGATCITOGA QCCTACfflTC 
2301 CTCCACTGCA GATCGOTTC AAACTOQGCC TCAGCTATAC CCCCTTOCSCA 
2351 GAAGTAGQCX: TGAATOCTCT AGAAGAATCG TCAATmTA 1TCACAGACA 
2401 TCTAATCCAG OCTTAITACA AAGACAnCT CCCCTOCCR3 GATOGATACC 
2451 TCAA(3ACrTC AGOOTGTCA GATGA(3A0CA AGAATAACTO GGAAGTOICA 
2501 GCTCTTTCTC QGQOTOCOCA GAAAGQATTT AATAAAGTOG TOITAAAGCA 
2551 TCTGAAGAAG ACAAAGAACT TTTCATCAAA CGAAGCAATA TCCmOAAG 
2601 AAATAAGAAT TAGAGTAGIA CAAATOCTTG GAOCTCTAGS AGGACAAATA 
2651 AACAAAAATC TTCTGACAGT CAOGTCCTCA GATSAGATCA TCAAGAGCTA 
2701 TSTGGOCTQG GACAGAGAGA AGOOGCTOAG CTITOCAGIG CCCTmGftG 
2751 AGATGAAACC TGTCATnTC CTQGATSIGT TOCTCOCICG AGTCACAGAA 
2801 TTAGCGCTCA CaOOCAGTOA CAGACAAACT AAAGITCCaG OCTOroAACT 
28S1 rmCATAGC ATQGTEATST TTATCITOQG CAAAGCCACG CAGATOCCRG 
2901 AAGQGGGACA GOGAGCXXXA CCCKKSEMX AGCICIATRA GOOGACJBXTT 
2951 CCTGTCCIGC TTOGACTTGC CTGTGATSrr GATCAGSIGA CAAGQCAACT 
3001 CTATGAGOCA CEAGTOATOC AGCTOATICA CTQGTKaCP AACaACAABA 
3051 AATTTGAAAG TCAGGATRCT GITIOCXTAC TAGAAGCTAT A1TOQATOGA 
3101 ATTOTQGACC CTGITGACAG TiACITEAAQA GATmTOIG GiaXSlBlXr 
3151 TOGAGAATTC CTEAAATQCT OCATEAAGCA AAaaACAOCA CAGCaocaOG 
3201 AGAAGAGTCC ACTAAACAOC AAATOQCm" TCAAQOGACT TTATAGCXaT 
3251 GOGCTTCACC CCAATCCTTT CAAGAGGCIG GGftGCMCAC TTOOCTmA 
3301 TAATATCTAC AQQGAATTCA OOGAAGAAGA GlClC' lWiU QAACAOriTC 
3351 TOirrGAAGC CITOCTSATA TACATSSRGA GlCiC)Ua. Vr . AOCaCATOCA 
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Figure 9b (Iv) 

3401 GATGAGAAGT CCTTAQGTAC AATTCAACAG TGITGTCATG CCATIXSATCA 

3451 CCTATGCCGC ATCATTGAAA AGAAGCATGT TTCTTTAAAT AAAGCAAAGA 

3501 AACGACGTTT GCCGCGAQGA TTTCCACCTT CCGCATCATT GTCTTTATTC 

3551 GATCTOGTCA AGTGGCrnT AGCTCATTGT GGGAGGCCCC AGACAGAATC 

3601 TCGACACAAA TCCATIGAAC TCTTTTATAA ATTCGTTCCT TTATTCCCAG 

3651 GCAACAGATC CCCTAATTTG TGGCTGAAAG ATGTTCTCAA GGAAGAAGGT 

3701 GTCTCTTTTC TCATCAACAC CTTTGAGGGG GGTGGCTCTG GCCAGCCCTC 

3751 GGGCATCCTG GCCCAGCCCA CCCTCTTGTA CCTICGGGGG CCATTCAGCC 

3801 TGCAGGCCAC GCTATGCTGG CTCGACCTGC TCCTCGCCGC GITCGAGTCC 

3851 TACAACACGT TCATTC3GCGA GAGAACTGTA GGAGCGCTCC AGGTCCTAGG 

3901 TACTGAAGCC CAGTCTTCAC TTTTGAAAGC AGTGGCTTTC TTCTTAGAAA 

3951 GCA7TGCCAT GCATGACATT ATAGCAGCAG AAAAGTCCTT TCGCACTCGG 

4001 GCAGCAQGTA ACAGAACAAG CXXIACAAGAG GGAGAAAGGT ACAACTACAG 

4051 CAAATGCACC GTTGTGGTCC GGATTATGGA GTITACCACG ACTCTCCTAA 

4101 ACACCTCCCC GGAAGGATGG AAGCTCCTCA AGAAGGACTT GTOTAATACA 

4151 CACX:TGATGA GAGTCCTGGT GCAGA03CTG TGTGAGCCCG CAAGCATAGG 

4201 tttcaacatc ggagacgtcc aggttatggc tcatcttcct gatctttctc 

4251 tgaatctgat gaaagctcta aagatgtccc catacaaaga tatxixttagag 

4301 accx:atctga gagagaaaat aacagcacag agcattgagg agctttctcc 

4351 cxncaacttg tatggccctg acgcgcaagt qgacagc3agc aggctogcix3 

4401 ctcntgtgtc tgcctgtaaa cagcttcaca gaqctogqct tcix3cataat 

4451 atattaccgt ctcagtccac agatitqcat cattcigttc qcacagaact 

4501 tctttccctg ctttataaag gcattgcccc tqgagatcag agacagictc 

4551 tgocttcrict agacxttcagt tgtaagcagc tqgccagcgg acitctogag 

4601 TTAGormu crrrroGAGG actgtgtgag ogccttctga G'xcTxtj'ixjc r 

4651 GAAOOCAQOG GTOCTOIXXA CGGCGTCCTT GGQCAGCTCA CAGGQGAQCG 

4701 TCATOCACTT CICCCATQQG GAGTATTTCT ATAGCTTGTT CTCAGAAACG 

4751 ATCAACACQG AATTATTGAA AAATCTSGAT CTTGCTCTAT TCGAGCICAT 

4801 GCAGTCTICA GTOGATAATA CCAAAATGGT GACTGCCCnT TTCAACXX3CA 

4851 TGTTAGACCA GAGCTTCAGG GAQCGAGCAA ACCAGAAACA CCAAGGACTC 

4901 AAACTTSCGA CTACAATICT GCAACACTOG AAGAAGICTG ATICATGCTG 

4951 QQOCAAAGAT TCOCXITCTOG AAACTAAAAT GGCAGTGCK5 GCXTTACTOG 

5001 CAAAAAirrT ACAGATTGAT TCATCTCTAT CTTTTAATAC AACTCATQC5T 
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Figure 9b (v) 

5051 TCATTCCCTG AAGTCTTTAC AACATATATT AGTCTACTTG CTGACACAAA 

5101 GCTOGATCTA CATTTAAAGG GCCAAGCTGT CACTCTICTT CCATTCTTCA 

5151 CCAGCCTCAC TGGAGGCAGT CTGGAQGAAC TrAGACGTGT TCTCGAGCAG 

5201 CTCATCGTTG CTCACTTCCC CATGCAGTCC AGGGAATTTC CTCCAGGAAC 

5251 TCXXXXSGTrC AATAATTATC TGGACTGCAT GAAAAAGTTT CTAGATGCAT 

5301 TCGAATTATC TCAAAGCCXT ATGTTCTIX3G AATTCATGAC AGAAGrTCTT 

5351 TCTOQGGAAC AGCAGCATGT CATCGAAGAA TTATTTCAAT CCAGTITCAG 

5401 GAGGATTGCC AGAAGGGGTT CATGTGTCAC ACAAGTAGGC CTTCTQGAAA 

5451 GCGTGTATGA AATGTTCAGG AAQGATGACX: CCOSCCTAAG TTTCACAOGC 

5501 CAGTCCTTTG TGGACCGCTC CCTCCTCACT CTGCTGTGGC ACTGTAGCCT 

5551 GGATGCTTTG AGAGAATTCT TCAGCACAAT TGTGGTGGAT GCCATTGATG 

5601 TCTTGAAGTC CAGGTITACA AAGCTAAATG AATCTACCTT TGATACTCAA 

5651 ATCACCAAGA AGATGGGCTA CTATAAGATT CTAGACGTGA TGTATTCTCG 

5701 CCTTCCCAAA GATGATCTTC ATGCTAAGGA ATCAAAAATT AATCAAGTTT 

5751 TCCATCGCTC GTGTA1TACA GAAGGAAATC AACTTACAAA GACAITGATT 

5801 AAATTC3TGCT ACGATGCATT TACAGAGAAC ATOGCAGGAG AGAATCAGCT 

5851 GCTGGAGAGG AGAAGACTTT ACCATTGTGC AGCATACAAC TGCGCCATAT 

5901 CTGTCATCTG CTGTGTCTTC AATGAGTTAA AATTTTAOCA AGGTITICTC 

5951 TTTAGTGAAA AACXZAGAAAA GAACTTCCTT ATimGAAA ATOTGATOGA 

6001 CXnGAAGCX3C CGCTATAATT TTCrTGTAGA AGITGAGGTT CXn!ATGGAAA 

6051 GAAAGAAAAA GTACATTGAA ATTAOGAAAG AAOCXIAGAGA AGCAGCAAAT 

6101 QQGGATTCAG ATOGTCCTTC CTATATCICT TCXTTOTCAT ATTTOSCAGA 

6151 CAGTACOCIG AGIGAGGAAA TGAGTCAATT TGATTTCICA ACOGGAGTTC 

6201 AGAGCTATTC ATACAOCTCC CAAGACXX^TA GkOCTGOChC TO CJiXJUnTXT 

6251 CGGAGACX9GG AGCAGOGGGA CCGCAOGCTTC CATGAT6ATG TQCTOGMSCT 

6301 GGAGAOGGAC GAOCTCAATC GGCATSA£?IG CATGGCX30CX: CTGAOQCXXX: 

6351 TGGICAAOCA CATGCACAGA AOOCTQCSGCX: GGCTTCAAOG AGAAGAGGAT 

6401 TCAGTOCCAA GAGATCTIOC TrCTTGGATG AAAITOCTCX: ATOGCAAACT 

6451 GQGAAATOCA ATAGTACCAT TAAATAT006 ' iCiirriU ' lTA GCXaAGCXTG 

6501 TEATTAATAC AGAAGAGGTC TrTOGOCXOT ACGC3GAAGCA C TO SCI TA GC 

6551 COCXTGCTOC AGCIGGCTGC TICTGAAAAC AATGGAGGAG AAOQAATTCA 

6601 CTACATGGTG GTTGAGATAG OGGCXIACTAT TCTTZCATSG ACAGGCTTGG 

6651 CXrACTOCAAC A0GGG7O0CT AAAGAOGAAG TCTTAGCAAA TOSATTGCZT 

6701 AATnCCTTAA TGAAACATOT CTTTCATOCA AAAAGAOCTO TSTTTAGACA 
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Figure 9b (vl> 
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6751 CAACXTTTGAA ATTATAAAGA CCCTTGTCGA GTGCTGGAAG GATTCTTTAT 

6801 CX:ATCCXnTA TAGGTTAATA TTTCAAAAGT rTTCCXSGTAA AGATCCTAAT 

6851 TCTAAAGACA ACTCAGTAGG GATTCAATTG CTAGGCATCG TCATCGCCAA 

6901 TGACCTGCCT CCCTATGACC CACAG1X3TCG CATCCAGAGT AGCGAATACT 

6951 TCCAGGCTTT GGTGAATAAT ATGTCCTITC TAAGATATAA AGAAGTCTAT 

7001 GCCGCTGCAG CAGAAGTTCT AGGACTTATA CTTCGATAIXS TTATCGAGAG 

7051 AAAAAACATA CTGGAGGAGT CTCTGTGTGA ACTGGTraCG AAACAATIX3A 

7101 AGCAACATCA GAATACTATG GAGGACAAGT TTAITCTCTC CTIGAACAAA 

7151 GTGACCAAGA GCTTCCCTCC TCTTGCAGAC AGGTTCATCA ATCCTOItSTT 

7201 CirrCTGCTG CCAAAATTTC ATCGAGTGTT GAAAACACTC TCTCTCGAGG 

7251 TGGTACTTTG TCGTGTGGAG GGAATGACAG AGCTGTACIT CCAGTTAAAG 

7301 AGCAAGGACT TCGTTCAAGT . CATGAGACAT AGAGATGAAA GACAAAAAGT 

7351 ATGTTK3GAC ATAATTTATA AGATGATCCC AAAGTTAAAA CCAGTAGAAC 

7401 TCCXy^GAACT TCTGAACCCC GTTGTGGAAT TCGTITCCCA TCCTTCTACA 

7451 ACATGTAGGG AACAAATGTA TAATATTCTC ATCTCGATIC ATCATAATTA 

7501 CAGAGATCCA GAAAGTGAGA CAGATAATCA CTCCCAGGAA ATATTTAAGT 

7551 TGGCAAAAGA TGTGCTGATT CAAGGA1TGA TCGATCAGAA CCCTCGACIT 

7601 CAATTAATTA TTCGAAATTT CTGGAGCCAT GAAACTAGGT TACCTTCAAA 

7651 TACCTTGGAC CGGTTGCTCG CACTAAATTC CTTATATTCT CCTAAGATAG 

7701 AACTGCACTT TTTAAGITrA GCAACAAATT TTCTGCTOGA AATGACCAGC 

7751 ATGAGCCCAG ATTATCCAAA CXXTATCTTC GAGCATCCIC TCTCAGAATG 

7801 CGAATITCAG GAATATACCA TTGATTCTGA TTOQCGTriC CGAAGTACIG 

7851 TTCTCACICC GATCTITCTG GAGACCCAGG CCTCCCAGGG CACTCICCAG 

7901 ACCXX3TACCC AGGAAGGGTC CCTCTCAGCT 0GCTCGCX::AG TGGCAGGGCA 

7951 GATAAGQGCC AOXAGCAGC AGCATCACTT CACACTGACA CAGACTOCAG 

8001 ATOGAAGAAG CTCATTTGAT TOGCTGACCG GGAGCAGCAC TGACCCX3CIX3 

8051 CTOGAOCACA CCAGTCCCTC ATCTGACIXX ri \3Cl\y m U CCCACAAGAG 

8101 GAGTGAAAGG TTACAGAGAG CAQCCITGAA GTCAGTOQGG CCTCATTITC 

8151 GGAAAAAAAG GCTOGGCCTT CXIAGGGGACG AGGTGGATAA CAAA(?IX5AAA 

8201 GGTGCGGCCG GCCGGACGGA CCTACTACXSA CTQCGCAGAC GGTTTATCAG 

8251 GGACCAGGAG AAGCTCAGIT TGAOGTAOGC CAGAAAAGGC GTTGCTSAGC 

8301 AAAAAOSAGA GAAGGAAATC AAGAGTCSAGT TAAAAATGAA GCAQGATOOC 

8351 CA0C3TOGT1C TQTACAGAAG CTACXX3GCAC OGAGAOC?nx: CTGACAOTCA 
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Figure 9b (vil) 

8401 GATCAAGCAC AGCAGCCTCA TCACCCCGTT ACAGGCOnG GCCCAGAGGG 

8451 ACCCAATAAT TGCAAAACAG CTCTTTAGCA GCTTGrmC TOGAATTITG 

8501 AAAGAGATGG ATAAATTTAA GACACTGTCT GAAAAAAACA ACATCACTCA 

8551 AAAGTTGCTT CAAGACTTCA ATCGTITrCT TAATAOCACC TTCTCTTTCT 

8601 TTCCACCCTT TGTCTCTTGT ATTCAGGACA TTAGCTGTCA GCACGCAGCC 

8651 CTGCTGAGCC TCGACCCAGC GGCTGTTAGC GCTGGTTGCC TGGCCAGCCT 

8701 ACAGCAGCCC GTGGGCATCC GCXTTGCTAGA GGAGGCTCTG CTCCGOCTGC 

8751 TGCCTGCTGA GCTGCCTGCC AAGCGAGTCC GTQGGAAGGC CCGCCTCCCT 

8801 CXnGATGTCC TCAGATGQGT GGAGCTTGCT AAGCTGTATA GATCAATTGG 

8851 AGAATACGAC GTCCTCCGTG GGATTTTTAC CAGTGAGATA GGAACAAAGC 

8901 AAATCACTCA GAGTGCATTA TTAGCAGAAG CXAGAAGTGA TTATTCTGAA 

8951 GCTGCTAAGC AGTATGATGA GGCTCTCAAT AAACAAGACT GQGTAGATSG 

9001 TGAGCCCACA GAAGCCGAGA AGGATTTTTG GGAACTTGCA TCCCTIGACT 

9051 GTTACAACCA CCTTGCTGAG TCGAAATCAC TTGAATACTG TTCTACAGCC 

9101 AGTATAGACA GTGAGAACXX: CCCAGACXTA AATAAAATCT GGAGTGAACX 

9151 ATTTTATCAG GAAACATATC TACCTTACAT GATCCQCAGC AAGCTGAAGC 

9201 TGCTGCTCCA GGGAGAGGCT GACCAGTOCC TGCTGACATT TATTGACAAA 

9251 GCTATOCAOG GGGAGCTOCA GAAGGOGATT CTAGAGCTTC AITACAGTCA 

.9301 AGAGCTGAGT CTGCTTTACC TCCTGCAAGA TGATCTTGAC AGAGOCAAAT 

.9351 ATTACATTCA AAATQGCATT CAGAGITITA TGCAGAATTA TTCTAGTATT 

. 9401 GATGTOCTCT TACAtXAAAG TAGACTCACX: AAATTQCAGT CTGTACAGQC 

9451 TmACAGAA ATTCAOGAGT TCATCAGCIT TATAAGCAAA CAAGGCAATT 

9501 TATCATCTCA AGTTCOCCIT AAGAGACTTC TGAACA0C7G GACAAACAGA 

9551 TATCXIAGATG CTAAAATGGA CXX:AATSAAC ATCTGGSATG ACATCATCAC 

9601 AAATOGATCT TTCTTTCTCA GCAAAATAGA GGAGAAGCTT ACCXXnCTTC 

9651 CAGAAGATAA TAGTAT8AAT GTGGATCAAG ATGGAGACXr CAGTSACAGG 

9701 ATOGAAGIGC AA6AGCAGGA AGAAGATATC AG C 1CC 3 CTGA TCAOGAGTTG 

9751 CAAGirmx; ATGAAAATGA AGAOGATAGA CAGIGCXXX3G AAGCAGAACA 

9801 ATTICTCACT TOCTATGAAA CTACTQAAGG AGCTGCATAA AGAGTCAAAA 

9851 ACCAGAGAOG ATTGGCIXSGT GAGCTGGGTG CAGAGCTACT GOCX30CIX3AG 

9901 0CACTGCXX3G AGOqGGICCC AGGGCTGCTC TGAGCAGGIG CTCACTCTOC 

9951 TGAAAACAGT CTCTTTOTTG GATGAGAACA ACGTGTCAAG CTACTTAARC 

10001 AAAAAOATTC TQGCTITOOG TGAOCAGAAC ATTCTCTTGG GTACAACTTA 

10051 CAGGAOCATA GCGAATGCTC OCAOCAGIGA GOCAGOCTGC dTGdGAAA 
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10101 TOGAGGAGGA CAAGGCTAGA AGAATCTrAG AGCTTTCTCG ATCCACTICA 
10151 GAGGATTCAG AGAAGGTGAT OGCGGGTCTG TACCAGAGAG CATTCCAGCA 
10201 CCTCTCTGAG GCTGTQCAGG OGGCTGAGGA GGAGGCCCAG CCTCCCTCCT 
10251 GGAGCTGTGG GCCTGCAGCT GGQGTGATTG ATOCTTACAT GACGCTOQCA 
10301 GATTICTCTG ACCAACAGCT GCGCAAGGAG GAAGAGAATC CATCAGTTAC 
10351 TGATTCTGCA GAACTGCAGG CX3TATCCAGC ACITOTOGre GAGAAAATCT 
10401 TGAAAGCTTT AAAATTAAAT TCCAATGAAG CCAGATTOAA GWTCCTAGA 
10451 TTACTTCAGA TTATAGAAOG CTATCCAQAG GAGACTITCA GCCTCATCAC 
10501 AAAAGAGATC TCTrCOGTTC CCTGCTQQC3V GnCATCAGC TCGATCAGCC 
10551 ACATGOTQGC CTTACIGGAC AAAGAC3CAAG COCTIGCTOT 'TCAGCACTCT 
• 10601 CTQGAA6AAA TCACTGATAA CTACOOGCAG GCTATTOTIT ATCCCTTCAT 
10651 CATAAGCAGC GAAAGCTATT CCTTCAAGGA TACITCT 
10701 ATAAGGAGIT TGTOXS^ ATKVAAAOTA AGT^ 
10751 ATTCAAGATT TTATTAATGC CTTAGATCAG CICICTAATC CrcAACTOCT 
10801 CITBUM3GAT TGGAGCAATG ATGTAAQAGC TCAAC^^ 
10851 TAAATAAAAA AAACATTGAA AAAATOTATG AAAGAATCTA TGCAGCCITC 
10901 OGTCSACCCAA AGGCTCCAGG CCTQGQC5QCC TTTAGAAGGA AGTITATICA 
10951 GACmTOGA AAAGAATTTO ATAAACATTT TQGSAAAGGA QGITCTAAAC 
11001 TACTGAGAAT GAAGCICAGT GACITCAACG ACATEACCAA CATCCIACTT 
11051 TCAAAAATSA ACAAAGACTC AAAQCOOCTr QQGAATCIGA AAGAATCITC 
11101 ACCCTQGATC AOOGACnCA AAOTQGAOTT CCTOAGAAAT GAGCTOC5AGA 
11151 TICCOQSTCA GTATGAOQCT AQQOGAAAGC CATTOCCaGA QTACCaOGro 
11201 OGAATOSCOG GSTTTGAOGA GOOSGTSACA CTCATOGCXST CTCTOOGAAG 
11251 OCGCaAOOGC ATCATCATOC OTOGCCATCA CGAGAQOGAA CACrCTITCC 
11301 TOOIGAAOQQ TSQOGAGGAC CTQOQOCaGS AOCAOOGOOT GGAOCftQClC 
11351 TPOCaGeiCA TSAATSQGAT OCTOQOCXaA GACICCGOCT QCAQOCaGAG 
11401 OSOCCTSCRO CTSROGAOCT ATAGCOPTST OCXXATOAOC TCCAOTOA-IC 
11451 CCASQQCAOC GCOGTSTGAA TATAAAGATT QOCTOACAAA AATOIXaOGA 
11501 AAACATGATS TrGGAOCTTA CATOCEAATG TA15AA0Q0CXS CTftATOOlAC 
11551 TGAAACAOTC ACOXCPITEA GAAAAOSAGA AAOrAAAfflG OCTOCTOATC 
11601 TCTTAAA0C3G GGOCTTOOro AOGATGAGTA CAAQCOCTCA GGCmoCIB 
11651 GOQCKCaCV OCSaClTOOC CAGCroiCAC GCICTOATAT GCA3CA0CCA 
11701 CTOGATOCTC GGQATroOAG ACAGACATCT QAACAACTIT ATOSIOSCXSl 
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11751 TGGAGACTGG CGGCGTGATC GGGATCGACT TTGGGCATCC GTITOGATCC 

11801 GCTACACAGT TPCTGCCAGT CCCTGAGTTG ATOCCTriTC GQCT7VACTCG 

11851 (XAGTITATC AATCTC5ATC3T TACCAATGAA AGAAACX5GGC CTTATCTACA 

11901 GCATCATCGT ACACGCACTC CGGGCCTTCC GCTCAGACCC TOGCCTOCIC 

11951 ACCAACACCA TCGATGTC3TT TCTCAAGGAG CCCTCCTITC AITOCSAAAAA 

12001 TTTTGAACAG AAAATGCTGA AAAAAQGAGG GTCATOGATT CAAGAAATAA 

12051 ATCTTGCTSA AAAAAATTGG TACCCCCGAC AGAAAATATC TTACGCTAAG 

12101 AGAAAGTTAG CAGGTGCCAA TCCAGCAGTC ATTACTTOTO ATCAGCTACT 

12151 OCIGGGTCAT GAGAAGGCXX: CIX3CCITCAG AGACTATOTO GCTOTOGCAC 

12201 GAGGAAGCAA AGATCACAAC ATTCGTCCCC AAGAACCAGA GA<nGGGCTr 

12251 TCAGAAGAGA CTCAAGTGAA GTGCCTCATG GACCAQGCAA CAGACXXX^^ 

12301 CATCCTTQGC AGAACCTGGG AAGGATGGGA GCCCTGGATC TCAGGTCTOT 

12351 GGGAGTCTGC AGATAGAAAG CATTACAITC TTTAAAGAAT CTACTATACT 

12401 TGGITOGCAG CATTCCATGA GCTGATTTTC CTCAAACACT AAAGAGAAAT 

12451 GTCrrrrGTC ctacagtitc gtagcatgag tttaaatcaa gattatcatg 

12501 ACTAAATGTX3 TATQGGTTAA ATCAAAGATA AGGTTATAGT AACATCAAAG 

12551 ATTAGCSTGAG GriTATAGAA AGATAGAOAT CCAQQCTrAC CAAAOTATTA 

12601 AGTCAAGAAT ATAATATGTG ATCAQCTITC AAASCATITA CAAGTOCIGC 

12651 AAGirAGTOA AACAGCTGTC TCOSTAAATG GAGGAAATGT GQQGAAGCCT 

12701 TX3GAATQCXX: I TC JGGTIUT QQCACATIXX5 AAAGOCACT CA^^ 

12751 CArrauXAAG ATTTTOGGAG AGTAAAGCTA 
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